Goidelex: A Lexical Resource for Old Irish

Cormac Anderson, Sacha Beniamine, Theodorus Fransen


Abstract
We introduce Goidelex, a new lexical database resource for Old Irish. Goidelex is an openly accessible relational database in CSV format, linked by formal relationships. The launch version documents 695 headwords with extensive linguistic annotations, including orthographic forms using a normalised orthography, automatically generated phonemic transcriptions, and information about morphosyntactic features, such as gender, inflectional class, etc. Metadata in JSON format, following the Frictionless standard, provides detailed descriptions of the tables and dataset. The database is designed to be fully compatible with the Paralex and CLDF standards and is interoperable with existing lexical resources for Old Irish such as CorPH and eDIL. It is suited to both qualitative and quantitative investigation into Old Irish morphology and lexicon, as well as to comparative research. This paper outlines the creation process, rationale, and resulting structure of the database.
Anthology ID:
2024.lt4hala-1.1
Volume:
Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Rachele Sprugnoli, Marco Passarotti
Venues:
LT4HALA | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/2024.lt4hala-1.1
DOI:
Bibkey:
Cite (ACL):
Cormac Anderson, Sacha Beniamine, and Theodorus Fransen. 2024. Goidelex: A Lexical Resource for Old Irish. In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pages 1–10, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Goidelex: A Lexical Resource for Old Irish (Anderson et al., LT4HALA-WS 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/2024.lt4hala-1.1.pdf