Abstract
This paper describes Lingua Custodia’s submission to the WMT’19 news shared task for German-to-French on the topic of the EU elections. We report experiments on the adaptation of the terminology of a machine translation system to a specific topic, aimed at providing more accurate translations of specific entities like political parties and person names, given that the shared task provided no in-domain training parallel data dealing with the restricted topic. Our primary submission to the shared task uses backtranslation generated with a type of decoding allowing the insertion of constraints in the output in order to guarantee the correct translation of specific terms that are not necessarily observed in the data.- Anthology ID:
- W19-5310
- Volume:
- Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- WMT
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 147–154
- Language:
- URL:
- https://aclanthology.org/W19-5310
- DOI:
- 10.18653/v1/W19-5310
- Cite (ACL):
- Franck Burlot. 2019. Lingua Custodia at WMT’19: Attempts to Control Terminology. In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), pages 147–154, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Lingua Custodia at WMT’19: Attempts to Control Terminology (Burlot, WMT 2019)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/W19-5310.pdf