Abstract
We propose a method for joint unsupervised discovery of multiword expressions (MWEs) and their translations from parallel corpora. First, we apply independent monolingual MWE extraction in source and target languages simultaneously. Then, we calculate translation probability, association score and distributional similarity of co-occurring pairs. Finally, we rank all translations of a given MWE using a linear combination of these features. Preliminary experiments on light verb constructions show promising results.- Anthology ID:
- W17-1711
- Volume:
- Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Editors:
- Stella Markantonatou, Carlos Ramisch, Agata Savary, Veronika Vincze
- Venue:
- MWE
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 91–96
- Language:
- URL:
- https://aclanthology.org/W17-1711
- DOI:
- 10.18653/v1/W17-1711
- Cite (ACL):
- Natalie Vargas, Carlos Ramisch, and Helena Caseli. 2017. Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 91–96, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment (Vargas et al., MWE 2017)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/W17-1711.pdf