Extended Study on Using Pretrained Language Models and YiSi-1 for Machine Translation Evaluation

Chi-Kiu Lo

Extended Study on Using Pretrained Language Models and YiSi-1 for Machine Translation Evaluation

Abstract

We present an extended study on using pretrained language models and YiSi-1 for machine translation evaluation. Although the recently proposed contextual embedding based metrics, YiSi-1, significantly outperform BLEU and other metrics in correlating with human judgment on translation quality, we have yet to understand the full strength of using pretrained language models for machine translation evaluation. In this paper, we study YiSi-1’s correlation with human translation quality judgment by varying three major attributes (which architecture; which inter- mediate layer; whether it is monolingual or multilingual) of the pretrained language mod- els. Results of the study show further improvements over YiSi-1 on the WMT 2019 Metrics shared task. We also describe the pretrained language model we trained for evaluating Inuktitut machine translation output.

Anthology ID:: 2020.wmt-1.99
Volume:: Proceedings of the Fifth Conference on Machine Translation
Month:: November
Year:: 2020
Address:: Online
Editors:: Loïc Barrault, Ondřej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 895–902
Language:
URL:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2020.wmt-1.99/
DOI:
Bibkey:
Cite (ACL):: Chi-kiu Lo. 2020. Extended Study on Using Pretrained Language Models and YiSi-1 for Machine Translation Evaluation. In Proceedings of the Fifth Conference on Machine Translation, pages 895–902, Online. Association for Computational Linguistics.
Cite (Informal):: Extended Study on Using Pretrained Language Models and YiSi-1 for Machine Translation Evaluation (Lo, WMT 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/jlcl-multiple-ingestion/2020.wmt-1.99.pdf
Video:: https://slideslive.com/38939652

PDF Cite Search Video Fix data