PATQUEST: Papago Translation Quality Estimation

Yujin Baek; Zae Myung Kim; Jihyung Moon; Hyunjoong Kim; Eunjeong Park

PATQUEST: Papago Translation Quality Estimation

Yujin Baek, Zae Myung Kim, Jihyung Moon, Hyunjoong Kim, Eunjeong Park

Abstract

This paper describes the system submitted by Papago team for the quality estimation task at WMT 2020. It proposes two key strategies for quality estimation: (1) task-specific pretraining scheme, and (2) task-specific data augmentation. The former focuses on devising learning signals for pretraining that are closely related to the downstream task. We also present data augmentation techniques that simulate the varying levels of errors that the downstream dataset may contain. Thus, our PATQUEST models are exposed to erroneous translations in both stages of task-specific pretraining and finetuning, effectively enhancing their generalization capability. Our submitted models achieve significant improvement over the baselines for Task 1 (Sentence-Level Direct Assessment; EN-DE only), and Task 3 (Document-Level Score).

Anthology ID:: 2020.wmt-1.113
Volume:: Proceedings of the Fifth Conference on Machine Translation
Month:: November
Year:: 2020
Address:: Online
Venues:: EMNLP | WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 991–998
Language:
URL:: https://aclanthology.org/2020.wmt-1.113
DOI:
Bibkey:
Cite (ACL):: Yujin Baek, Zae Myung Kim, Jihyung Moon, Hyunjoong Kim, and Eunjeong Park. 2020. PATQUEST: Papago Translation Quality Estimation. In Proceedings of the Fifth Conference on Machine Translation, pages 991–998, Online. Association for Computational Linguistics.
Cite (Informal):: PATQUEST: Papago Translation Quality Estimation (Baek et al., WMT 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/update-css-js/2020.wmt-1.113.pdf
Video:: https://slideslive.com/38939610

PDF Cite Search Video