Abstract
We describe the University of Edinburgh’s Bengali↔Hindi constrained systems submitted to the WMT21 News Translation task. We submitted ensembles of Transformer models built with large-scale back-translation and fine-tuned on subsets of training data retrieved based on similarity to the target domain.- Anthology ID:
- 2021.wmt-1.16
- Volume:
- Proceedings of the Sixth Conference on Machine Translation
- Month:
- November
- Year:
- 2021
- Address:
- Online
- Editors:
- Loic Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussa, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Tom Kocmi, Andre Martins, Makoto Morishita, Christof Monz
- Venue:
- WMT
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 180–186
- Language:
- URL:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2021.wmt-1.16/
- DOI:
- Cite (ACL):
- Proyag Pal, Alham Fikri Aji, Pinzhen Chen, and Sukanta Sen. 2021. The University of Edinburgh’s Bengali-Hindi Submissions to the WMT21 News Translation Task. In Proceedings of the Sixth Conference on Machine Translation, pages 180–186, Online. Association for Computational Linguistics.
- Cite (Informal):
- The University of Edinburgh’s Bengali-Hindi Submissions to the WMT21 News Translation Task (Pal et al., WMT 2021)
- PDF:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2021.wmt-1.16.pdf
- Data
- CCAligned