Morphological Parsing for Media Lengua: When Accessibility Matters More Than State-of-the-Art

Jesse Stewart, Olga Kriukova


Abstract
While machine learning approaches dominate contemporary NLP research, a critical gap exists between published models and tools actually used by target communities (Gessler & von der Wense, 2024). This paper presents two morphological parsers for Media Lengua (ISO 639-3: mue), an endangered mixed language of Ecuador, demonstrating that a JavaScript rule-based system (98.6% accuracy) can outperform a CRF model (95.7% F1) while offering immediate community accessibility.Not all language structures permit straightforward rule-based parsing; however, when a language’s morphology allows for this approach with competitive accuracy, we argue that it should be preferred for its practical advantages: immediate browser-based deployment, transparency, zero infrastructure requirements, and long-term maintainability. Our rule-based parser runs entirely in the browser, is freely available online, and can be adapted to other Quechuan languages. In contrast, while the CRF model performs well on benchmarks, it requires additional infrastructure to become accessible.Our comparison highlights the need to evaluate NLP tools not only on accuracy metrics but also on accessibility and real-world adoption, which is particularly crucial for endangered language communities where sustainable, community-accessible tools can support language documentation, education, and revitalization.
Anthology ID:
2026.computel-1.1
Volume:
Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–9
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.1/
DOI:
Bibkey:
Cite (ACL):
Jesse Stewart and Olga Kriukova. 2026. Morphological Parsing for Media Lengua: When Accessibility Matters More Than State-of-the-Art. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 1–9, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Morphological Parsing for Media Lengua: When Accessibility Matters More Than State-of-the-Art (Stewart & Kriukova, ComputEL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.1.pdf
Supplementarymaterial:
 2026.computel-1.1.SupplementaryMaterial.txt