Abstract
This paper describes the participation of team oneNLP (LTRC, IIIT-Hyderabad) for the WMT 2021 task, similar language translation. We experimented with transformer based Neural Machine Translation and explored the use of language similarity for Tamil-Telugu and Telugu-Tamil. We incorporated use of different subword configurations, script conversion and single model training for both directions as exploratory experiments.- Anthology ID:
- 2021.wmt-1.30
- Volume:
- Proceedings of the Sixth Conference on Machine Translation
- Month:
- November
- Year:
- 2021
- Address:
- Online
- Venue:
- WMT
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 288–291
- Language:
- URL:
- https://aclanthology.org/2021.wmt-1.30
- DOI:
- Cite (ACL):
- Vandan Mujadia and Dipti Sharma. 2021. Low Resource Similar Language Neural Machine Translation for Tamil-Telugu. In Proceedings of the Sixth Conference on Machine Translation, pages 288–291, Online. Association for Computational Linguistics.
- Cite (Informal):
- Low Resource Similar Language Neural Machine Translation for Tamil-Telugu (Mujadia & Sharma, WMT 2021)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2021.wmt-1.30.pdf