@inproceedings{nguyen-etal-2025-hybrid,
    title = "A hybrid Approach to low-resource machine translation for {O}jibwe verbs",
    author = "Nguyen, Minh  and
      Hammerly, Christopher  and
      Slifverberg, Miikka",
    editor = "Mager, Manuel  and
      Ebrahimi, Abteen  and
      Pugh, Robert  and
      Rijhwani, Shruti  and
      Von Der Wense, Katharina  and
      Chiruzzo, Luis  and
      Coto-Solano, Rolando  and
      Oncevay, Arturo",
    booktitle = "Proceedings of the Fifth Workshop on NLP for Indigenous Languages of the Americas (AmericasNLP)",
    month = may,
    year = "2025",
    address = "Albuquerque, New Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2025.americasnlp-1.3/",
    doi = "10.18653/v1/2025.americasnlp-1.3",
    pages = "18--26",
    ISBN = "979-8-89176-236-7",
    abstract = "Machine translation is a tool that can help teachers, learners, and users of low-resourced languages. However, there are significant challenges in developing these tools, such as the lack of large-scale parallel corpora and complex morphology. We propose a novel hybrid system that combines LLM and rule-based methods in two distinct stages to translate inflected Ojibwe verbs into English. We use an LLM to automatically annotate dictionary data to build translation templates. Then, our rulebased module performs translation using inflection and slot-filling processes built on top of an FST-based analyzer. We test the system with a set of automated tests. Thanks to the ahead-of-time nature of the template-building process and the light-weight rule-based translation module, the end-to-end translation process has an average translation speed of 70 milliseconds per word. The system achieved an average ChrF score of 0.82 and a semantic similarity score of 0.93 among the successfully translated verbs in a test set. The approach has the potential to be extended to other low-resource Indigenous languages with dictionary data."
}Markdown (Informal)
[A hybrid Approach to low-resource machine translation for Ojibwe verbs](https://preview.aclanthology.org/ingest-emnlp/2025.americasnlp-1.3/) (Nguyen et al., AmericasNLP 2025)
ACL