Natalie Vargas
2017
Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment
Natalie Vargas
|
Carlos Ramisch
|
Helena Caseli
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
We propose a method for joint unsupervised discovery of multiword expressions (MWEs) and their translations from parallel corpora. First, we apply independent monolingual MWE extraction in source and target languages simultaneously. Then, we calculate translation probability, association score and distributional similarity of co-occurring pairs. Finally, we rank all translations of a given MWE using a linear combination of these features. Preliminary experiments on light verb constructions show promising results.