An Attention-Based Neural Translation System for English to Bodo

Subhash Wary, Birhang Borgoyary, Akher Ahmed, Mohanji Sah, Apurbalal Senapati


Abstract
Bodo is a resource scarce, the indigenous language belongs to the Tibeto-Burman family. It is mainly spoken in the north-east region of India. It has both linguistic and cultural significance in the region. Only a limited number of resources and tools are available in this language. This paper presents a study of neural machine translation for the English-Bodo language pair. The system is developed on a relatively small parallel corpus provided by the Low-Resource Indic Language Translation as a part of WMT-2025. The system is evaluated by the WMT-2025 organizers with the evaluation matrices like BLUE, METEOR, ROUGE-L, chrF and TER. The result is not promising but it will help for the further improvement. The result is not encouraging, but it provides a foundation for further improvement.
Anthology ID:
2025.wmt-1.96
Volume:
Proceedings of the Tenth Conference on Machine Translation
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1210–1214
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.96/
DOI:
Bibkey:
Cite (ACL):
Subhash Wary, Birhang Borgoyary, Akher Ahmed, Mohanji Sah, and Apurbalal Senapati. 2025. An Attention-Based Neural Translation System for English to Bodo. In Proceedings of the Tenth Conference on Machine Translation, pages 1210–1214, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
An Attention-Based Neural Translation System for English to Bodo (Wary et al., WMT 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.wmt-1.96.pdf