Abstract
Evaluation discrepancy and overcorrection phenomenon are two common problems in neural machine translation (NMT). NMT models are generally trained with word-level learning objective, but evaluated by sentence-level metrics. Moreover, the cross-entropy loss function discourages model to generate synonymous predictions and overcorrect them to ground truth words. To address these two drawbacks, we adopt multi-task learning and propose a mixed learning objective (MLO) which combines the strength of word-level and sentence-level evaluation without modifying model structure. At word-level, it calculates semantic similarity between predicted and ground truth words. At sentence-level, it computes probabilistic n-gram matching scores of generated translations. We also combine a loss-sensitive scheduled sampling decoding strategy with MLO to explore its extensibility. Experimental results on IWSLT 2016 German-English and WMT 2019 English-Chinese datasets demonstrate that our methodology can significantly promote translation quality. The ablation study shows that both word-level and sentence-level learning objectives can improve BLEU scores. Furthermore, MLO is consistent with state-of-the-art scheduled sampling methods and can achieve further promotion.- Anthology ID:
- 2020.ccl-1.90
- Volume:
- Proceedings of the 19th Chinese National Conference on Computational Linguistics
- Month:
- October
- Year:
- 2020
- Address:
- Haikou, China
- Editors:
- Maosong Sun (孙茂松), Sujian Li (李素建), Yue Zhang (张岳), Yang Liu (刘洋)
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 974–983
- Language:
- English
- URL:
- https://aclanthology.org/2020.ccl-1.90
- DOI:
- Cite (ACL):
- Wenjie Lu, Leiying Zhou, Gongshen Liu, and Quanhai Zhang. 2020. A Mixed Learning Objective for Neural Machine Translation. In Proceedings of the 19th Chinese National Conference on Computational Linguistics, pages 974–983, Haikou, China. Chinese Information Processing Society of China.
- Cite (Informal):
- A Mixed Learning Objective for Neural Machine Translation (Lu et al., CCL 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2020.ccl-1.90.pdf