Abstract
In this paper, we introduce the first dependency treebank for the Umbrian language (an extinct Indo-European language from the Italic family, once spoken in modern day Italy). We present the source of the corpus : a set of seven bronze tablets describing religious ceremonies, written using two different scripts, unearthed in Umbria in the XVth century. The corpus itself has already been studied extensively by specialists of old Italic and classical Indo-European languages. So we discuss a number of challenges that we encountered as we annotated the corpus following Universal Dependencies’ guidelines from existing textual analyses.- Anthology ID:
- 2022.lt4hala-1.6
- Volume:
- Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Editors:
- Rachele Sprugnoli, Marco Passarotti
- Venue:
- LT4HALA
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 38–42
- Language:
- URL:
- https://aclanthology.org/2022.lt4hala-1.6
- DOI:
- Cite (ACL):
- Mathieu Dehouck. 2022. The IKUVINA Treebank. In Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages, pages 38–42, Marseille, France. European Language Resources Association.
- Cite (Informal):
- The IKUVINA Treebank (Dehouck, LT4HALA 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2022.lt4hala-1.6.pdf