Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation

Yuki Nakatani; Tomoyuki Kajiwara; Takashi Ninomiya

Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation

Yuki Nakatani, Tomoyuki Kajiwara, Takashi Ninomiya

Abstract

In text generation tasks such as machine translation, models are generally trained using cross-entropy loss. However, mismatches between the loss function and the evaluation metric are often problematic. It is known that this problem can be addressed by direct optimization to the evaluation metric with reinforcement learning. In machine translation, previous studies have used BLEU to calculate rewards for reinforcement learning, but BLEU is not well correlated with human evaluation. In this study, we investigate the impact on machine translation quality through reinforcement learning based on evaluation metrics that are more highly correlated with human evaluation. Experimental results show that reinforcement learning with BERT-based rewards can improve various evaluation metrics.

Anthology ID:: 2022.wat-1.2
Volume:: Proceedings of the 9th Workshop on Asian Translation
Month:: October
Year:: 2022
Address:: Gyeongju, Republic of Korea
Venue:: WAT
SIG:
Publisher:: International Conference on Computational Linguistics
Note:
Pages:: 37–43
Language:
URL:: https://aclanthology.org/2022.wat-1.2
DOI:
Bibkey:
Cite (ACL):: Yuki Nakatani, Tomoyuki Kajiwara, and Takashi Ninomiya. 2022. Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation. In Proceedings of the 9th Workshop on Asian Translation, pages 37–43, Gyeongju, Republic of Korea. International Conference on Computational Linguistics.
Cite (Informal):: Comparing BERT-based Reward Functions for Deep Reinforcement Learning in Machine Translation (Nakatani et al., WAT 2022)
Copy Citation:
PDF:: https://preview.aclanthology.org/proper-vol2-ingestion/2022.wat-1.2.pdf

PDF Cite Search