English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation
Sahinur Rahman Laskar, Pankaj Dadure, Riyanka Manna, Partha Pakray, Sivaji Bandyopadhyay
Abstract
Automatic translation of one natural language to another is a popular task of natural language processing. Although the deep learning-based technique known as neural machine translation (NMT) is a widely accepted machine translation approach, it needs an adequate amount of training data, which is a challenging issue for low-resource pair translation. Moreover, the multimodal concept utilizes text and visual features to improve low-resource pair translation. WAT2022 (Workshop on Asian Translation 2022) organizes (hosted by the COLING 2022) English to Bengali multimodal translation task where we have participated as a team named CNLP-NITS-PP in two tracks: 1) text-only and 2) multimodal translation. Herein, we have proposed a transliteration-based phrase pairs augmentation approach which shows improvement in the multimodal translation task and achieved benchmark results on Bengali Visual Genome 1.0 dataset. We have attained the best results on the challenge and evaluation test set for English to Bengali multimodal translation with BLEU scores of 28.70, 43.90 and RIBES scores of 0.688931, 0.780669, respectively.- Anthology ID:
- 2022.wat-1.14
- Volume:
- Proceedings of the 9th Workshop on Asian Translation
- Month:
- October
- Year:
- 2022
- Address:
- Gyeongju, Republic of Korea
- Venue:
- WAT
- SIG:
- Publisher:
- International Conference on Computational Linguistics
- Note:
- Pages:
- 111–116
- Language:
- URL:
- https://aclanthology.org/2022.wat-1.14
- DOI:
- Cite (ACL):
- Sahinur Rahman Laskar, Pankaj Dadure, Riyanka Manna, Partha Pakray, and Sivaji Bandyopadhyay. 2022. English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation. In Proceedings of the 9th Workshop on Asian Translation, pages 111–116, Gyeongju, Republic of Korea. International Conference on Computational Linguistics.
- Cite (Informal):
- English to Bengali Multimodal Neural Machine Translation using Transliteration-based Phrase Pairs Augmentation (Laskar et al., WAT 2022)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2022.wat-1.14.pdf