A hybrid Approach to low-resource machine translation for Ojibwe verbs

Minh Nguyen, Christopher Hammerly, Miikka Slifverberg


Abstract
Machine translation is a tool that can help teachers, learners, and users of low-resourced languages. However, there are significant challenges in developing these tools, such as the lack of large-scale parallel corpora and complex morphology. We propose a novel hybrid system that combines LLM and rule-based methods in two distinct stages to translate inflected Ojibwe verbs into English. We use an LLM to automatically annotate dictionary data to build translation templates. Then, our rulebased module performs translation using inflection and slot-filling processes built on top of an FST-based analyzer. We test the system with a set of automated tests. Thanks to the ahead-of-time nature of the template-building process and the light-weight rule-based translation module, the end-to-end translation process has an average translation speed of 70 milliseconds per word. The system achieved an average ChrF score of 0.82 and a semantic similarity score of 0.93 among the successfully translated verbs in a test set. The approach has the potential to be extended to other low-resource Indigenous languages with dictionary data.
Anthology ID:
2025.americasnlp-1.3
Volume:
Proceedings of the Fifth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Manuel Mager, Abteen Ebrahimi, Robert Pugh, Shruti Rijhwani, Katharina Von Der Wense, Luis Chiruzzo, Rolando Coto-Solano, Arturo Oncevay
Venues:
AmericasNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–26
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.americasnlp-1.3/
DOI:
Bibkey:
Cite (ACL):
Minh Nguyen, Christopher Hammerly, and Miikka Slifverberg. 2025. A hybrid Approach to low-resource machine translation for Ojibwe verbs. In Proceedings of the Fifth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP), pages 18–26, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
A hybrid Approach to low-resource machine translation for Ojibwe verbs (Nguyen et al., AmericasNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.americasnlp-1.3.pdf