IndigiEval: Evaluating LLMs in North American Indigenous Languages

Julia Mainzinger, Jacqueline Brixey


Abstract
This paper presents IndigiEval, a framework for evaluating the language and cultural proficiency of several commercially available large language models (LLMs) across five North American Indigenous languages (Mvskoke, Choctaw, Cherokee, Cheyenne, and Hawaiian). This framework is a qualitative evaluation method intended for communities with small speaker populations to be able to critically evaluate LLM performance with minimal data and human effort. IndigiEval includes tasks such as answering cultural questions, translation, text generation, and speech recognition. The results of our experiments indicate that no currently available LLM performs well across all evaluation categories, and that LLMs frequently hallucinate orthographies, grammatical structures, cultural knowledge, and vocabulary for all languages and cultures considered. Our proposed evaluation framework is not intended as a comprehensive score, but rather a qualitative and flexible framework to inform language communities about a given LLM’s potential as a resource, since each language has unique environments, strengths, and availability of resources.
Anthology ID:
2026.americasnlp-6.8
Volume:
Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Manuel Mager, Abteen Ebrahimi, Minh Duc Bui, Robert Pugh, Arturo Oncevay, Luis Chiruzzo, Rolando Coto Solano, Shruti Rijhwani, Katharina Von Der Wense
Venues:
AmericasNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
82–94
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.americasnlp-6.8/
DOI:
Bibkey:
Cite (ACL):
Julia Mainzinger and Jacqueline Brixey. 2026. IndigiEval: Evaluating LLMs in North American Indigenous Languages. In Proceedings of the Sixth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP), pages 82–94, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
IndigiEval: Evaluating LLMs in North American Indigenous Languages (Mainzinger & Brixey, AmericasNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.americasnlp-6.8.pdf
Supplementarymaterial:
 2026.americasnlp-6.8.SupplementaryMaterial.zip