Choosing an ASR model for Dënë Sųłıné: Navigating polysynthesis and unstandardized orthography

Olga Kriukova; Antti Arppe; Olga Lovick

Choosing an ASR model for Dënë Sųłıné: Navigating polysynthesis and unstandardized orthography

Abstract

While several pre-trained multilingual models are actively used for fine-tuning on under-resourced and endangered languages, it remains unclear which architectures perform better and what factors explain their varying performance across languages. Although this question may be less pressing for languages with adequate resources, it is critical for endangered language communities, where limited time and funding to experiment with multiple model options are available (Jimerson et al., 2023). We compare the performance of two ASR architectures, Wav2Vec2 and Whisper, on a Dënë Sųłıné dataset. This language and dataset present several challenges common to under-resourced and endangered languages: unstandardized orthography, pronunciation variation, and phonological and morphosyntactic structures that differ from the major languages represented in the multilingual datasets used for pre-training large ASR models. Although Wav2Vec2 reportedly outperforms Whisper in low-resource settings (see e.g., Coto-Solano et al., 2024; Nahabwe et al., 2025; Williams et al., 2023), our study shows that Whisper yields significantly better results on the Dënë Sųłıné dataset. These findings suggest that model performance may depend not only on architecture, dataset size, or typological features of language, but also on dataset-specific characteristics. In our case, Whisper showed better adaptability to a dataset with inconsistent spelling and pronunciation. Further verification across similarly inconsistent datasets is required to assess the generalizability of this result.

Anthology ID:: 2026.computel-1.3
Volume:: Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
Venues:: ComputEL | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15–25
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.3/
DOI:
Bibkey:
Cite (ACL):: Olga Kriukova, Antti Arppe, and Olga Lovick. 2026. Choosing an ASR model for Dënë Sųłıné: Navigating polysynthesis and unstandardized orthography. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 15–25, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Choosing an ASR model for Dënë Sųłıné: Navigating polysynthesis and unstandardized orthography (Kriukova et al., ComputEL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.3.pdf
Supplementarymaterial:: 2026.computel-1.3.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Fix data