Accurate Word Alignment Induction from Neural Machine Translation

Yun Chen; Yang Liu; Guanhua Chen; Xin Jiang; Qun Liu

doi:10.18653/v1/2020.emnlp-main.42

Accurate Word Alignment Induction from Neural Machine Translation

Yun Chen, Yang Liu, Guanhua Chen, Xin Jiang, Qun Liu

Abstract

Despite its original goal to jointly learn to align and translate, prior researches suggest that Transformer captures poor word alignments through its attention mechanism. In this paper, we show that attention weights do capture accurate word alignments and propose two novel word alignment induction methods Shift-Att and Shift-AET. The main idea is to induce alignments at the step when the to-be-aligned target token is the decoder input rather than the decoder output as in previous work. Shift-Att is an interpretation method that induces alignments from the attention weights of Transformer and does not require parameter update or architecture change. Shift-AET extracts alignments from an additional alignment module which is tightly integrated into Transformer and trained in isolation with supervision from symmetrized Shift-Att alignments. Experiments on three publicly available datasets demonstrate that both methods perform better than their corresponding neural baselines and Shift-AET significantly outperforms GIZA++ by 1.4-4.8 AER points.

Anthology ID:: 2020.emnlp-main.42
Volume:: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Month:: November
Year:: 2020
Address:: Online
Editors:: Bonnie Webber, Trevor Cohn, Yulan He, Yang Liu
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 566–576
Language:
URL:: https://aclanthology.org/2020.emnlp-main.42
DOI:: 10.18653/v1/2020.emnlp-main.42
Bibkey:
Cite (ACL):: Yun Chen, Yang Liu, Guanhua Chen, Xin Jiang, and Qun Liu. 2020. Accurate Word Alignment Induction from Neural Machine Translation. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 566–576, Online. Association for Computational Linguistics.
Cite (Informal):: Accurate Word Alignment Induction from Neural Machine Translation (Chen et al., EMNLP 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/dois-2013-emnlp/2020.emnlp-main.42.pdf
Video:: https://slideslive.com/38938969
Code: sufe-nlp/transformer-alignment

PDF Search Code Video