2025
pdf
bib
abs
Tuebingen at SemEval-2025 Task 10: Class Weighting, External Knowledge and Data Augmentation in BERT Models
Özlem Karabulut
|
Soudabeh Eslami
|
Ali Gharaee
|
Matthew Andrews
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
The spread of disinformation and propaganda in online news presents a significant challengeto information integrity. As part of the SemEval 2025 Task-10 on Multilingual Characterization and Extraction of Narratives from Online News, this study focuses on Subtask 1: Entity Framing, which involves assigning roles to named entities within news articles across multiple languages.We investigate techniques such as data augmentation, external knowledge, and class weighting to improve classification performance. Our findings indicate that class weighting was more effective than other approaches
pdf
bib
abs
Parallel Universal Dependencies Treebanks for Turkic Languages
Arofat Akhundjanova
|
Furkan Akkurt
|
Bermet Chontaeva
|
Soudabeh Eslami
|
Cagri Coltekin
Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)
We introduce the first fully aligned and manually annotated parallel Universal Dependencies (UD) treebanks for four Turkic languages: Azerbaijani, Kyrgyz, Turkish, and Uzbek. These resources currently consist of 148 strategically selected sentences that illustrate typologically significant morphosyntactic phenomena across these related yet distinct languages. These parallel treebanks enable systematic comparative studies of Turkic syntax and may be instrumental in cross-lingual NLP applications. All treebanks are available as part of UD v2.16.
2024
pdf
bib
abs
Strategies for the Annotation of Pronominalised Locatives in Turkic Universal Dependency Treebanks
Jonathan Washington
|
Çağrı Çöltekin
|
Furkan Akkurt
|
Bermet Chontaeva
|
Soudabeh Eslami
|
Gulnura Jumalieva
|
Aida Kasieva
|
Aslı Kuzgun
|
Büşra Marşan
|
Chihiro Taguchi
Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024
As part of our efforts to develop unified Universal Dependencies (UD) guidelines for Turkic languages, we evaluate multiple approaches to a difficult morphosyntactic phenomenon, pronominal locative expressions formed by a suffix -ki. These forms result in multiple syntactic words, with potentially conflicting morphological features, and participating in different dependency relations. We describe multiple approaches to the problem in current (and upcoming) Turkic UD treebanks, and show that none of them offers a solution that satisfies a number of constraints we consider (including constraints imposed by UD guidelines). This calls for a compromise with the ‘least damage’ that should be adopted by most, if not all, Turkic treebanks. Our discussion of the phenomenon and various annotation approaches may also help treebanking efforts for other languages or language families with similar constructions.