Bridging Digital Tools for Linguistic Documentation and Revitalization
Christopher Haberland, Carly Crowther, Jingnong Qu, Anuk Centellas
Abstract
Digital tools serving language revitalization tend to fall into two categories: 1) linguist-oriented documentation tools that prioritize annotation, morphological analysis, and archival preservation, and 2) community-facing applications that emphasize accessibility and language learning. Few systems integrate the former with the latter, and practical barriers — including the cost of computational expertise, single-user workflows, and limited data governance — further constrain their utility. These disconnects incur additional development and communication costs for revitalization teams consisting of linguists and community members. We introduce "langlit", a collaborative web-based platform that attempts to tailor documentation workflows for the language revitalization context within a single system. The platform integrates a finite-state morphological analyzer with a three-tier human-in-the-loop annotation workflow, searchable corpus interfaces with multiple query modalities, interactive word construction guided by the morphological grammar, corpus-linked hypothesis tracking with provenance, and a grammar-derived editable dictionary. All components share a single underlying FST grammar, and the system supports configurable access controls, collaborative editing, and optional LLM integration with transparent data handling. Designed for redeployment across languages through a modular architecture, "langlit" is published as an open-source repository on GitHub. We situate our system within the existing landscape of revitalization tools through a comparative analysis and discuss how integrated, community-informed design can better serve the specific goals of language revitalization.- Anthology ID:
- 2026.americasnlp-6.3
- Volume:
- Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Manuel Mager, Abteen Ebrahimi, Minh Duc Bui, Robert Pugh, Arturo Oncevay, Luis Chiruzzo, Rolando Coto Solano, Shruti Rijhwani, Katharina Von Der Wense
- Venues:
- AmericasNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 22–32
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.americasnlp-6.3/
- DOI:
- Cite (ACL):
- Christopher Haberland, Carly Crowther, Jingnong Qu, and Anuk Centellas. 2026. Bridging Digital Tools for Linguistic Documentation and Revitalization. In Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP), pages 22–32, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- Bridging Digital Tools for Linguistic Documentation and Revitalization (Haberland et al., AmericasNLP 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.americasnlp-6.3.pdf