Bridging the Gap between Training and Inference for Neural Machine Translation

Wen Zhang; Yang Feng (冯洋); Fandong Meng; Di You; Qun Liu

doi:10.18653/v1/P19-1426

Bridging the Gap between Training and Inference for Neural Machine Translation

Wen Zhang, Yang Feng, Fandong Meng, Di You, Qun Liu

Abstract

Neural Machine Translation (NMT) generates target words sequentially in the way of predicting the next word conditioned on the context words. At training time, it predicts with the ground truth words as context while at inference it has to generate the entire sequence from scratch. This discrepancy of the fed context leads to error accumulation among the way. Furthermore, word-level training requires strict matching between the generated sequence and the ground truth sequence which leads to overcorrection over different but reasonable translations. In this paper, we address these issues by sampling context words not only from the ground truth sequence but also from the predicted sequence by the model during training, where the predicted sequence is selected with a sentence-level optimum. Experiment results on Chinese->English and WMT’14 English->German translation tasks demonstrate that our approach can achieve significant improvements on multiple datasets.

Anthology ID:: P19-1426
Volume:: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2019
Address:: Florence, Italy
Editors:: Anna Korhonen, David Traum, Lluís Màrquez
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4334–4343
Language:
URL:: https://preview.aclanthology.org/build-pipeline-with-new-library/P19-1426/
DOI:: 10.18653/v1/P19-1426
Award:: Best Long Paper
Bibkey:
Cite (ACL):: Wen Zhang, Yang Feng, Fandong Meng, Di You, and Qun Liu. 2019. Bridging the Gap between Training and Inference for Neural Machine Translation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4334–4343, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: Bridging the Gap between Training and Inference for Neural Machine Translation (Zhang et al., ACL 2019)
Copy Citation:
PDF:: https://preview.aclanthology.org/build-pipeline-with-new-library/P19-1426.pdf
Video:: https://preview.aclanthology.org/build-pipeline-with-new-library/P19-1426.mp4

PDF Search Video Fix metadata