Noa Solissa
Also published as: Noa Visser Solissa
2026
GOLEMcoref: A Multilingual Coreference Dataset of Fiction
Andreas Van Cranenburgh | Xiaoyan Yang | Alvanita | Cecilia Nicole Di Domenico | Maria Ferragud | Arianna Graciotti | Byungjun Kim | Seonyeong Park | Noa Visser Solissa | Xiaoyu Zhou | Federico Pianzola
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Andreas Van Cranenburgh | Xiaoyan Yang | Alvanita | Cecilia Nicole Di Domenico | Maria Ferragud | Arianna Graciotti | Byungjun Kim | Seonyeong Park | Noa Visser Solissa | Xiaoyu Zhou | Federico Pianzola
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
We present a multilingual coreference dataset of 827k tokens of fiction in 7 languages: Bahasa Indonesia, Chinese, Dutch, English, Italian, Korean, and Spanish. The dataset includes full stories of diverse lengths, ranging from 500 to 17k words. We discuss our annotation scheme focusing on characters and language-specific challenges we encountered. Finally we present evaluation results of a neural coreference system trained on our dataset. We show that jointly training a system across all languages provides a strong improvement over monolingually trained models. The dataset is available under a creative commons license in CoNLL-2012 and CorefUD format at https://github.com/GOLEM-lab/GOLEMcoref/
2024
GroningenAnnotatesGaza at the FIGNEWS 2024 Shared Task: Analyzing Bias in Conflict Narratives
Khalid Khatib | Sara Gemelli | Saskia Heisterborg | Pritha Majumdar | Gosse Minnema | Arianna Muti | Noa Solissa
Proceedings of the Second Arabic Natural Language Processing Conference
Khalid Khatib | Sara Gemelli | Saskia Heisterborg | Pritha Majumdar | Gosse Minnema | Arianna Muti | Noa Solissa
Proceedings of the Second Arabic Natural Language Processing Conference
In this paper we report the development of our annotation methodology for the shared task FIGNEWS 2024. The objective of the shared task is to look into the layers of bias in how the war on Gaza is represented in media narrative. Our methodology follows the prescriptive paradigm, in which guidelines are detailed and refined through an iterative process in which edge cases are discussed and converged. Our IAA score (Krippendorff’s 𝛼) is 0.420, highlighting the challenging and subjective nature of the task. Our results show that 52% of posts were unbiased, 42% biased against Palestine, 5% biased against Israel, and 3% biased against both. 16% were unclear or not applicable.