Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support

Changbing Yang, Pt Anderson, Godfred Agyapong, Sarah Moeller


Abstract
Annotation tools are foundational infrastructure for language documentation, yet few comprehensive surveys have evaluated the tool landscape specifically from a documentary linguistics perspective. We survey 98 annotation tools across dimensions critical to language documentation workflows: annotation support, collaboration features, active learning, cost and openness, and institutional sustainability. Of the 44 tools both free and accessible for evaluation, only 15 support morpheme segmentation and glossing, and only 6 combine morphological annotation with remote collaboration at no cost. We identify a structural gap between the current tools and the requirements of field linguists working with endangered and Indigenous languages. While many NLP tools prioritize scalable annotation for high-resource settings, documentary linguists need interlinear glossed text (IGT) support and community-accessible interfaces. We taxonomise the tool landscape, present a multi-dimensional feature matrix, suggest current tools for language documentation, and conclude with concrete recommendations for tool developers and the documentary linguistics community.
Anthology ID:
2026.computel-1.9
Volume:
Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
80–92
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.9/
DOI:
Bibkey:
Cite (ACL):
Changbing Yang, Pt Anderson, Godfred Agyapong, and Sarah Moeller. 2026. Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 80–92, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support (Yang et al., ComputEL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.9.pdf
Supplementarymaterial:
 2026.computel-1.9.SupplementaryMaterial.txt