Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System
Abstract
We describe in this paper a hybrid ap-proach to build automatically bilingual lexicons of Multiword Expressions (MWEs) from parallel corpora. We more specifically investigate the impact of using a domain-specific bilingual lexicon of MWEs on domain adaptation of an Example-Based Machine Translation (EBMT) system. We conducted experiments on the English-French language pair and two kinds of texts: in-domain texts from Europarl (European Parliament proceedings) and out-of-domain texts from Emea (European Medicines Agency documents) and Ecb (European Central Bank corpus). The obtained results indicate that integrating domain-specific bilingual lexicons of MWEs improves translation quality of the EBMT system when texts to translate are related to the specific domain and induces a relatively slight deterioration of translation quality when translating general-purpose texts.- Anthology ID:
- R17-1085
- Volume:
- Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
- Month:
- September
- Year:
- 2017
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 661–670
- Language:
- URL:
- https://doi.org/10.26615/978-954-452-049-6_085
- DOI:
- 10.26615/978-954-452-049-6_085
- Cite (ACL):
- Nasredine Semmar and Mariama Laib. 2017. Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 661–670, Varna, Bulgaria. INCOMA Ltd..
- Cite (Informal):
- Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System (Semmar & Laib, RANLP 2017)
- PDF:
- https://doi.org/10.26615/978-954-452-049-6_085