Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support
Changbing Yang, Pt Anderson, Godfred Agyapong, Sarah Moeller
Abstract
Annotation tools are foundational infrastructure for language documentation, yet few comprehensive surveys have evaluated the tool landscape specifically from a documentary linguistics perspective. We survey 98 annotation tools across dimensions critical to language documentation workflows: annotation support, collaboration features, active learning, cost and openness, and institutional sustainability. Of the 44 tools both free and accessible for evaluation, only 15 support morpheme segmentation and glossing, and only 6 combine morphological annotation with remote collaboration at no cost. We identify a structural gap between the current tools and the requirements of field linguists working with endangered and Indigenous languages. While many NLP tools prioritize scalable annotation for high-resource settings, documentary linguists need interlinear glossed text (IGT) support and community-accessible interfaces. We taxonomise the tool landscape, present a multi-dimensional feature matrix, suggest current tools for language documentation, and conclude with concrete recommendations for tool developers and the documentary linguistics community.- Anthology ID:
- 2026.computel-1.9
- Volume:
- Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
- Venues:
- ComputEL | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 80–92
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.9/
- DOI:
- Cite (ACL):
- Changbing Yang, Pt Anderson, Godfred Agyapong, and Sarah Moeller. 2026. Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 80–92, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- Annotation Tools for Language Documentation: A Survey of Capabilities, Gaps, and Morphological Support (Yang et al., ComputEL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.9.pdf