Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation

Sweta Agrawal; Julia Kreutzer; Colin Cherry

Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation

Sweta Agrawal, Julia Kreutzer, Colin Cherry

Abstract

Non-autoregressive (NAR) machine translation has recently received significant developments and now achieves comparable quality with autoregressive (AR) models on some benchmarks while providing an efficient alternative to AR inference. However, while AR translation is often used to implement multilingual models that benefit from transfer between languages and from improved serving efficiency, multilingual NAR models remain relatively unexplored. Taking Connectionist Temporal Classification as an example NAR model and IMPUTER as a semi-NAR model, we present a comprehensive empirical study of multilingual NAR. We test its capabilities with respect to positive transfer between related languages and negative transfer under capacity constraints. As NAR models require distilled training sets, we carefully study the impact of bilingual versus multilingual teachers. Finally, we fit a scaling law for multilingual NAR to determine capacity bottlenecks, which quantifies its performance relative to the AR model as the model scale increases.

Anthology ID:: 2022.wmt-1.11
Volume:: Proceedings of the Seventh Conference on Machine Translation (WMT)
Month:: December
Year:: 2022
Address:: Abu Dhabi, United Arab Emirates (Hybrid)
Editors:: Philipp Koehn, Loïc Barrault, Ondřej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 177–187
Language:
URL:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2022.wmt-1.11/
DOI:
Bibkey:
Cite (ACL):: Sweta Agrawal, Julia Kreutzer, and Colin Cherry. 2022. Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation. In Proceedings of the Seventh Conference on Machine Translation (WMT), pages 177–187, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):: Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation (Agrawal et al., WMT 2022)
Copy Citation:
PDF:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2022.wmt-1.11.pdf
Video:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2022.wmt-1.11.mp4

PDF Cite Search Video Fix data