Development of Old Irish Lexical Resources, and Two Universal Dependencies Treebanks for Diplomatically Edited Old Irish Text

Adrian Doyle, John McCrae


Abstract
The quantity and variety of Old Irish text which survives in contemporary manuscripts, those dating from the Old Irish period, is quite small by comparison to what is available for Modern Irish, not to mention better-resourced modern languages. As no native speakers have existed for more than a millennium, no more text will ever be created by native speakers. For these reasons, text surviving in contemporary sources is particularly valuable. Ideally, all such text would be annotated using a single, common standard to ensure compatibility. At present, discrete Old Irish text repositories make use of incompatible annotation styles, few of which are utilised by text resources for other languages. This limits the potential for using text from more than any one resource simultaneously in NLP applications, or as a basis for creating further resources. This paper describes the production of the first Old Irish text resources to be designed specifically to ensure lexical compatibility and interoperability.
Anthology ID:
2025.nlp4dh-1.34
Volume:
Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities
Month:
May
Year:
2025
Address:
Albuquerque, USA
Editors:
Mika Hämäläinen, Emily Öhman, Yuri Bizzoni, So Miyagawa, Khalid Alnajjar
Venues:
NLP4DH | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
393–402
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.nlp4dh-1.34/
DOI:
Bibkey:
Cite (ACL):
Adrian Doyle and John McCrae. 2025. Development of Old Irish Lexical Resources, and Two Universal Dependencies Treebanks for Diplomatically Edited Old Irish Text. In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pages 393–402, Albuquerque, USA. Association for Computational Linguistics.
Cite (Informal):
Development of Old Irish Lexical Resources, and Two Universal Dependencies Treebanks for Diplomatically Edited Old Irish Text (Doyle & McCrae, NLP4DH 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.nlp4dh-1.34.pdf