Abstract
We describe in this paper a hybrid ap-proach to build automatically bilingual lexicons of Multiword Expressions (MWEs) from parallel corpora. We more specifically investigate the impact of using a domain-specific bilingual lexicon of MWEs on domain adaptation of an Example-Based Machine Translation (EBMT) system. We conducted experiments on the English-French language pair and two kinds of texts: in-domain texts from Europarl (European Parliament proceedings) and out-of-domain texts from Emea (European Medicines Agency documents) and Ecb (European Central Bank corpus). The obtained results indicate that integrating domain-specific bilingual lexicons of MWEs improves translation quality of the EBMT system when texts to translate are related to the specific domain and induces a relatively slight deterioration of translation quality when translating general-purpose texts.- Anthology ID:
 - R17-1085
 - Volume:
 - Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
 - Month:
 - September
 - Year:
 - 2017
 - Address:
 - Varna, Bulgaria
 - Venue:
 - RANLP
 - SIG:
 - Publisher:
 - INCOMA Ltd.
 - Note:
 - Pages:
 - 661–670
 - Language:
 - URL:
 - https://doi.org/10.26615/978-954-452-049-6_085
 - DOI:
 - 10.26615/978-954-452-049-6_085
 - Cite (ACL):
 - Nasredine Semmar and Mariama Laib. 2017. Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 661–670, Varna, Bulgaria. INCOMA Ltd..
 - Cite (Informal):
 - Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System (Semmar & Laib, RANLP 2017)
 - PDF:
 - https://doi.org/10.26615/978-954-452-049-6_085