Vanessa Toborek


Fixing paper assignments

  1. Please select all papers that belong to the same person.
  2. Indicate below which author they should be assigned to.
Provide a valid ORCID iD here. This will be used to match future papers to this author.
Provide the name of the school or the university where the author has received or will receive their highest degree (e.g., Ph.D. institution for researchers, or current affiliation for students). This will be used to form the new author page ID, if needed.

TODO: "submit" and "cancel" buttons here


2025

pdf bib
Beyond Shallow Heuristics: Leveraging Human Intuition for Curriculum Learning
Vanessa Toborek | Sebastian Müller | Tim Selbach | Tamás Horváth | Christian Bauckhage
Proceedings of the 8th International Conference on Natural Language and Speech Processing (ICNLSP-2025)

2023

pdf bib
A New Aligned Simple German Corpus
Vanessa Toborek | Moritz Busch | Malte Boßert | Christian Bauckhage | Pascal Welke
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

“Leichte Sprache”, the German counterpart to Simple English, is a regulated language aiming to facilitate complex written language that would otherwise stay inaccessible to different groups of people. We present a new sentence-aligned monolingual corpus for Simple German – German. It contains multiple document-aligned sources which we have aligned using automatic sentence-alignment methods. We evaluate our alignments based on a manually labelled subset of aligned documents. The quality of our sentence alignments, as measured by the F1-score, surpasses previous work. We publish the dataset under CC BY-SA and the accompanying code under MIT license.