The IKUVINA Treebank

Mathieu Dehouck


Abstract
In this paper, we introduce the first dependency treebank for the Umbrian language (an extinct Indo-European language from the Italic family, once spoken in modern day Italy). We present the source of the corpus : a set of seven bronze tablets describing religious ceremonies, written using two different scripts, unearthed in Umbria in the XVth century. The corpus itself has already been studied extensively by specialists of old Italic and classical Indo-European languages. So we discuss a number of challenges that we encountered as we annotated the corpus following Universal Dependencies’ guidelines from existing textual analyses.
Anthology ID:
2022.lt4hala-1.6
Volume:
Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Rachele Sprugnoli, Marco Passarotti
Venue:
LT4HALA
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
38–42
Language:
URL:
https://aclanthology.org/2022.lt4hala-1.6
DOI:
Bibkey:
Cite (ACL):
Mathieu Dehouck. 2022. The IKUVINA Treebank. In Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages, pages 38–42, Marseille, France. European Language Resources Association.
Cite (Informal):
The IKUVINA Treebank (Dehouck, LT4HALA 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/2022.lt4hala-1.6.pdf