Adapting Definition Modeling for New Languages: A Case Study on Belarusian

Daniela Kazakouskaya, Timothee Mickus, Janine Siewert


Abstract
Definition modeling, the task of generating new definitions for words in context, holds great prospect as a means to assist the work of lexicographers in documenting a broader variety of lects and languages, yet much remains to be done in order to assess how we can leverage pre-existing models for as-of-yet unsupported languages. In this work, we focus on adapting existing models to Belarusian, for which we propose a novel dataset of 43,150 definitions. Our experiments demonstrate that adapting a definition modeling systems requires minimal amounts of data, but that there currently are gaps in what automatic metrics do capture.
Anthology ID:
2025.bsnlp-1.8
Volume:
Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Jakub Piskorski, Pavel Přibáň, Preslav Nakov, Roman Yangarber, Michal Marcinczuk
Venues:
BSNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
69–75
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bsnlp-1.8/
DOI:
Bibkey:
Cite (ACL):
Daniela Kazakouskaya, Timothee Mickus, and Janine Siewert. 2025. Adapting Definition Modeling for New Languages: A Case Study on Belarusian. In Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025), pages 69–75, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Adapting Definition Modeling for New Languages: A Case Study on Belarusian (Kazakouskaya et al., BSNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bsnlp-1.8.pdf