Simona Doneva
2020
Terminology-Constrained Neural Machine Translation at SAP
Miriam Exel
|
Bianka Buschbeck
|
Lauritz Brandt
|
Simona Doneva
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
This paper examines approaches to bias a neural machine translation model to adhere to terminology constraints in an industrial setup. In particular, we investigate variations of the approach by Dinu et al. (2019), which uses inline annotation of the target terms in the source segment plus source factor embeddings during training and inference, and compare them to constrained decoding. We describe the challenges with respect to terminology in our usage scenario at SAP and show how far the investigated methods can help to overcome them. We extend the original study to a new language pair and provide an in-depth evaluation including an error classification and a human evaluation.