Abstract
We present a Universal Dependencies (UD) treebank for Highland Puebla Nahuatl. The treebank is only the second such UD corpus for a Mexican language, and supplements an existing treebank for another Nahuatl variant. We describe the process of data collection, annotation decisions and interesting syntactic constructions, and discuss some similarities and differences between the Highland Puebla Nahuatl treebank and the existing Western Sierra Puebla Nahuatl treebank.- Anthology ID:
- 2024.naacl-long.76
- Volume:
- Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
- Month:
- June
- Year:
- 2024
- Address:
- Mexico City, Mexico
- Editors:
- Kevin Duh, Helena Gomez, Steven Bethard
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1393–1403
- Language:
- URL:
- https://aclanthology.org/2024.naacl-long.76
- DOI:
- Cite (ACL):
- Robert Pugh and Francis Tyers. 2024. A Universal Dependencies Treebank for Highland Puebla Nahuatl. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 1393–1403, Mexico City, Mexico. Association for Computational Linguistics.
- Cite (Informal):
- A Universal Dependencies Treebank for Highland Puebla Nahuatl (Pugh & Tyers, NAACL 2024)
- PDF:
- https://preview.aclanthology.org/fix-volume-bibkeys/2024.naacl-long.76.pdf