Low-Resource Methods for Hawaiian Machine Translation

Nolan Brophy, Winston Wu


Abstract
This paper investigates the challenges of low-resource machine translation for ʻŌlelo Hawaiʻi (Hawaiian), a critically endangered Polynesian language. We compile a corpus of publicly available Hawaiian-English bitext and investigate the effectiveness of neural sequence-to-sequence models and large language models for translating Hawaiian. To address data scarcity, we employ various data augmentation techniques, including backtranslation, multilingual training using parallel corpora in related languages, and leveraging dictionary entries. Our experiments demonstrate that multilingual training significantly improves model performance, particularly when incorporating bitext from related Polynesian languages. Fine-tuned large language models were not able to outperform mBART, highlighting that smaller and simpler models are still relevant, especially in low-resource scenarios.
Anthology ID:
2026.computel-1.11
Volume:
Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Godfred Agyapong, Sarah Moeller, Antti Arppe, Ali Marashian, Daisy Rosenblum
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
104–110
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.11/
DOI:
Bibkey:
Cite (ACL):
Nolan Brophy and Winston Wu. 2026. Low-Resource Methods for Hawaiian Machine Translation. In Proceedings of the Ninth Workshop on the Use of Computational Methods in the Study of Endangered Languages (ComputEL-9), pages 104–110, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Low-Resource Methods for Hawaiian Machine Translation (Brophy & Wu, ComputEL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.computel-1.11.pdf
Supplementarymaterial:
 2026.computel-1.11.SupplementaryMaterial.txt