Re-translation versus Streaming for Simultaneous Translation

Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, George Foster


Abstract
There has been great progress in improving streaming machine translation, a simultaneous paradigm where the system appends to a growing hypothesis as more source content becomes available. We study a related problem in which revisions to the hypothesis beyond strictly appending words are permitted. This is suitable for applications such as live captioning an audio feed. In this setting, we compare custom streaming approaches to re-translation, a straightforward strategy where each new source token triggers a distinct translation from scratch. We find re-translation to be as good or better than state-of-the-art streaming systems, even when operating under constraints that allow very few revisions. We attribute much of this success to a previously proposed data-augmentation technique that adds prefix-pairs to the training data, which alongside wait-k inference forms a strong baseline for streaming translation. We also highlight re-translation’s ability to wrap arbitrarily powerful MT systems with an experiment showing large improvements from an upgrade to its base model.
Anthology ID:
2020.iwslt-1.27
Volume:
Proceedings of the 17th International Conference on Spoken Language Translation
Month:
July
Year:
2020
Address:
Online
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Association for Computational Linguistics
Note:
Pages:
220–227
Language:
URL:
https://aclanthology.org/2020.iwslt-1.27
DOI:
10.18653/v1/2020.iwslt-1.27
Bibkey:
Cite (ACL):
Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, and George Foster. 2020. Re-translation versus Streaming for Simultaneous Translation. In Proceedings of the 17th International Conference on Spoken Language Translation, pages 220–227, Online. Association for Computational Linguistics.
Cite (Informal):
Re-translation versus Streaming for Simultaneous Translation (Arivazhagan et al., IWSLT 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.iwslt-1.27.pdf
Video:
 http://slideslive.com/38929612