Viability of Machine Translation for Healthcare in Low-Resourced Languages

Hellina Hailu Nigatu, Nikita Mehandru, Negasi Haile Abadi, Blen Gebremeskel, Ahmed Alaa, Monojit Choudhury


Abstract
Machine Translation errors in high-stakes settings like healthcare pose unique risks that could lead to clinical harm. The challenges are even more pronounced for low-resourced languages where human translators are scarce and MT tools perform poorly. In this work, we provide a taxonomy of Machine Translation errors for the healthcare domain using a publicly available MT system. Preparing an evaluation dataset from pre-existing medical datasets, we conduct our study focusing on two low-resourced languages: Amharic and Tigrinya. Based on our error analysis and findings from prior work, we test two pre-translation interventions–namely, paraphrasing the source sentence and pivoting with a related language– for their effectiveness in reducing clinical risk. We find that MT errors for healthcare most commonly happen when the source sentence includes medical terminology and procedure descriptions, synonyms, figurative language, and word order differences. We find that pre-translation interventions are not effective in reducing clinical risk if the base translation model performs poorly. Based on our findings, we provide recommendations for improving MT for healthcare.
Anthology ID:
2025.emnlp-main.535
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10595–10609
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.535/
DOI:
Bibkey:
Cite (ACL):
Hellina Hailu Nigatu, Nikita Mehandru, Negasi Haile Abadi, Blen Gebremeskel, Ahmed Alaa, and Monojit Choudhury. 2025. Viability of Machine Translation for Healthcare in Low-Resourced Languages. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 10595–10609, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Viability of Machine Translation for Healthcare in Low-Resourced Languages (Nigatu et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.535.pdf
Checklist:
 2025.emnlp-main.535.checklist.pdf