Annotating MWEs in the Irish UD Treebank

Sarah McGuinness, Jason Phelan, Abigail Walsh, Teresa Lynn


Abstract
This paper reports on the analysis and annotation of Multiword Expressions in the Irish Universal Dependency Treebank. We provide a linguistic discussion around decisions on how to appropri- ately label Irish MWEs using the compound, flat and fixed dependency relation labels within the framework of the Universal Dependencies annotation guidelines. We discuss some nuances of the Irish language that pose challenges for assigning these UD labels and provide this report in support of the Irish UD annotation guidelines. With this we hope to ensure consistency in annotation across the dataset and provide a basis for future MWE annotation for Irish.
Anthology ID:
2020.udw-1.15
Volume:
Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020)
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Venue:
UDW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
126–139
Language:
URL:
https://aclanthology.org/2020.udw-1.15
DOI:
Bibkey:
Cite (ACL):
Sarah McGuinness, Jason Phelan, Abigail Walsh, and Teresa Lynn. 2020. Annotating MWEs in the Irish UD Treebank. In Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), pages 126–139, Barcelona, Spain (Online). Association for Computational Linguistics.
Cite (Informal):
Annotating MWEs in the Irish UD Treebank (McGuinness et al., UDW 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.udw-1.15.pdf
Data
Universal Dependencies