Findings of WMT 2024 Shared Task on Low-Resource Indic Languages Translation
Partha Pakray, Santanu Pal, Advaitha Vetagiri, Reddi Krishna, Arnab Kumar Maji, Sandeep Dash, Lenin Laitonjam, Lyngdoh Sarah, Riyanka Manna
Abstract
This paper presents the results of the low-resource Indic language translation task, organized in conjunction with the Ninth Conference on Machine Translation (WMT) 2024. In this edition, participants were challenged to develop machine translation models for four distinct language pairs: English–Assamese, English-Mizo, English-Khasi, and English-Manipuri. The task utilized the enriched IndicNE-Corp1.0 dataset, which includes an extensive collection of parallel and monolingual corpora for northeastern Indic languages. The evaluation was conducted through a comprehensive suite of automatic metrics—BLEU, TER, RIBES, METEOR, and ChrF—supplemented by meticulous human assessment to measure the translation systems’ performance and accuracy. This initiative aims to drive advancements in low-resource machine translation and make a substantial contribution to the growing body of knowledge in this dynamic field.- Anthology ID:
- 2024.wmt-1.54
- Volume:
- Proceedings of the Ninth Conference on Machine Translation
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
- Venues:
- WMT | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 654–668
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.wmt-1.54/
- DOI:
- 10.18653/v1/2024.wmt-1.54
- Cite (ACL):
- Partha Pakray, Santanu Pal, Advaitha Vetagiri, Reddi Krishna, Arnab Kumar Maji, Sandeep Dash, Lenin Laitonjam, Lyngdoh Sarah, and Riyanka Manna. 2024. Findings of WMT 2024 Shared Task on Low-Resource Indic Languages Translation. In Proceedings of the Ninth Conference on Machine Translation, pages 654–668, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Findings of WMT 2024 Shared Task on Low-Resource Indic Languages Translation (Pakray et al., WMT 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.wmt-1.54.pdf