Generative Multilingual Coreference Resolution at CRAC 2026

Jakub Hejman, Ondrej Prazak, Miloslav Konopík


Abstract
Participating again in this year’s edition of the CRAC shared task on coreference resolution, we present our upgraded system with an official uplift of 15.46 percentage points in CoNLL-U score. We incorporated the larger Gemma 3 27B IT model, joint pre-training, headword tagging, more efficient training and inference as well as a sliding window to achieve this result. Our system placed second in the LLM track and third overall with a primary score of 73.83. We reached the highest scores on two datasets. Finally, we compare specialized and general LLM approaches.
Anthology ID:
2026.codi-1.22
Volume:
Proceedings of the 2nd Joint Workshop on Computational Approaches to Discourse, Context and Document-Level Inferences and Computational Models of Reference, Anaphora and Coreference (CODI-CRAC 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Chloé Braud, Christian Hardmeier, Maciej Ogrodniczuk, Sharid Loaiciga, Amir Zeldes, Michal Novák, Chuyuan Li, Michael Strube, Junyi Jessy Li
Venues:
CODI | CRAC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
162–166
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.codi-1.22/
DOI:
Bibkey:
Cite (ACL):
Jakub Hejman, Ondrej Prazak, and Miloslav Konopík. 2026. Generative Multilingual Coreference Resolution at CRAC 2026. In Proceedings of the 2nd Joint Workshop on Computational Approaches to Discourse, Context and Document-Level Inferences and Computational Models of Reference, Anaphora and Coreference (CODI-CRAC 2026), pages 162–166, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Generative Multilingual Coreference Resolution at CRAC 2026 (Hejman et al., CODI-CRAC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.codi-1.22.pdf