Language Resource Building and English-to-Mizo Neural Machine Translation Encountering Tonal Words

Vanlalmuansangi Khenglawt, Sahinur Rahman Laskar, Santanu Pal, Partha Pakray, Ajoy Kumar Khan


Abstract
Multilingual country like India has an enormous linguistic diversity and has an increasing demand towards developing language resources such that it will outreach in various natural language processing applications like machine translation. Low-resource language translation possesses challenges in the field of machine translation. The challenges include the availability of corpus and differences in linguistic information. This paper investigates a low-resource language pair, English-to-Mizo exploring neural machine translation by contributing an Indian language resource, i.e., English-Mizo corpus. In this work, we explore one of the main challenges to tackling tonal words existing in the Mizo language, as they add to the complexity on top of low-resource challenges for any natural language processing task. Our approach improves translation accuracy by encountering tonal words of Mizo and achieved a state-of-the-art result in English-to-Mizo translation.
Anthology ID:
2022.wildre-1.9
Volume:
Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Girish Nath Jha, Sobha L., Kalika Bali, Atul Kr. Ojha
Venue:
WILDRE
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
48–54
Language:
URL:
https://aclanthology.org/2022.wildre-1.9
DOI:
Bibkey:
Cite (ACL):
Vanlalmuansangi Khenglawt, Sahinur Rahman Laskar, Santanu Pal, Partha Pakray, and Ajoy Kumar Khan. 2022. Language Resource Building and English-to-Mizo Neural Machine Translation Encountering Tonal Words. In Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference, pages 48–54, Marseille, France. European Language Resources Association.
Cite (Informal):
Language Resource Building and English-to-Mizo Neural Machine Translation Encountering Tonal Words (Khenglawt et al., WILDRE 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl-24-ws-corrections/2022.wildre-1.9.pdf