Abstract
This paper reports on the analysis and annotation of Multiword Expressions in the Irish Universal Dependency Treebank. We provide a linguistic discussion around decisions on how to appropri- ately label Irish MWEs using the compound, flat and fixed dependency relation labels within the framework of the Universal Dependencies annotation guidelines. We discuss some nuances of the Irish language that pose challenges for assigning these UD labels and provide this report in support of the Irish UD annotation guidelines. With this we hope to ensure consistency in annotation across the dataset and provide a basis for future MWE annotation for Irish.- Anthology ID:
- 2020.udw-1.15
- Volume:
- Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020)
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona, Spain (Online)
- Venue:
- UDW
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 126–139
- Language:
- URL:
- https://aclanthology.org/2020.udw-1.15
- DOI:
- Cite (ACL):
- Sarah McGuinness, Jason Phelan, Abigail Walsh, and Teresa Lynn. 2020. Annotating MWEs in the Irish UD Treebank. In Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), pages 126–139, Barcelona, Spain (Online). Association for Computational Linguistics.
- Cite (Informal):
- Annotating MWEs in the Irish UD Treebank (McGuinness et al., UDW 2020)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2020.udw-1.15.pdf
- Data
- Universal Dependencies