Universal Dependencies for Amahuaca

Candy Angulo, Pilar Valenzuela, Roberto Zariquiey


Abstract
This paper presents the creation of a Universal Dependency (UD) treebank for Amahuaca (Peru), marking the first UD treebank within the Headwaters subbranch of the Panoan family, spoken mostly in Peru and Brazil. While the UD guidelines provided a general framework for our annotations, language-specific decisions were necessary due to the rich morphology of the Amahuaca language. The paper also describes specific constructions to initiate a discussion on several general UD annotation guidelines, particularly those concerning clitics and morpheme-level dependencies.
Anthology ID:
2025.computel-main.17
Volume:
Proceedings of the Eight Workshop on the Use of Computational Methods in the Study of Endangered Languages
Month:
March
Year:
2025
Address:
Honolulu, Hawaii, USA
Editors:
Jordan Lachler, Godfred Agyapong, Antti Arppe, Sarah Moeller, Aditi Chaudhary, Shruti Rijhwani, Daisy Rosenblum
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
150–154
Language:
URL:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.computel-main.17/
DOI:
Bibkey:
Cite (ACL):
Candy Angulo, Pilar Valenzuela, and Roberto Zariquiey. 2025. Universal Dependencies for Amahuaca. In Proceedings of the Eight Workshop on the Use of Computational Methods in the Study of Endangered Languages, pages 150–154, Honolulu, Hawaii, USA. Association for Computational Linguistics.
Cite (Informal):
Universal Dependencies for Amahuaca (Angulo et al., ComputEL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/Ingest-2025-COMPUTEL/2025.computel-main.17.pdf