Ulf A. Hamster


Fixing paper assignments

  1. Please select all papers that belong to the same person.
  2. Indicate below which author they should be assigned to.
Provide a valid ORCID iD here. This will be used to match future papers to this author.
Provide the name of the school or the university where the author has received or will receive their highest degree (e.g., Ph.D. institution for researchers, or current affiliation for students). This will be used to form the new author page ID, if needed.

TODO: "submit" and "cancel" buttons here


2022

pdf bib
Everybody likes short sentences - A Data Analysis for the Text Complexity DE Challenge 2022
Ulf A. Hamster
Proceedings of the GermEval 2022 Workshop on Text Complexity Assessment of German Text

The German Text Complexity Assessment Shared Task in KONVENS 2022 explores how to predict a complexity score for sentence examples from language learners’ perspective. Our modeling approach for this shared task utilizes off-the-shelf NLP tools for feature engineering and a Random Forest regression model. We identified the text length, or resp. the logarithm of a sentence’s string length, as the most important feature to predict the complexity score. Further analysis showed that the Pearson correlation between text length and complexity score is about 𝜌 ≈ 0.777. A sensitivity analysis on the loss function revealed that semantic SBert features impact the complexity score as well.
Search
Co-authors
    Venues
    Fix data