Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation

Pei Zhang; Boxing Chen; Niyu Ge; Kai Fan

doi:10.18653/v1/2020.emnlp-main.81

Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation

Pei Zhang, Boxing Chen, Niyu Ge, Kai Fan

Abstract

Many document-level neural machine translation (NMT) systems have explored the utility of context-aware architecture, usually requiring an increasing number of parameters and computational complexity. However, few attention is paid to the baseline model. In this paper, we research extensively the pros and cons of the standard transformer in document-level translation, and find that the auto-regressive property can simultaneously bring both the advantage of the consistency and the disadvantage of error accumulation. Therefore, we propose a surprisingly simple long-short term masking self-attention on top of the standard transformer to both effectively capture the long-range dependence and reduce the propagation of errors. We examine our approach on the two publicly available document-level datasets. We can achieve a strong result in BLEU and capture discourse phenomena.

Anthology ID:: 2020.emnlp-main.81
Volume:: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Month:: November
Year:: 2020
Address:: Online
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1081–1087
Language:
URL:: https://aclanthology.org/2020.emnlp-main.81
DOI:: 10.18653/v1/2020.emnlp-main.81
Bibkey:
Cite (ACL):: Pei Zhang, Boxing Chen, Niyu Ge, and Kai Fan. 2020. Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1081–1087, Online. Association for Computational Linguistics.
Cite (Informal):: Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation (Zhang et al., EMNLP 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/update-css-js/2020.emnlp-main.81.pdf
Optional supplementary material:: 2020.emnlp-main.81.OptionalSupplementaryMaterial.pdf
Video:: https://slideslive.com/38939313

PDF Cite Search Optional supplementary material Video