Contextualized Translations of Phrasal Verbs with Distributional Compositional Semantics and Monolingual Corpora

Pablo Gamallo, Susana Sotelo, José Ramom Pichel, Mikel Artetxe


Abstract
This article describes a compositional distributional method to generate contextualized senses of words and identify their appropriate translations in the target language using monolingual corpora. Word translation is modeled in the same way as contextualization of word meaning, but in a bilingual vector space. The contextualization of meaning is carried out by means of distributional composition within a structured vector space with syntactic dependencies, and the bilingual space is created by means of transfer rules and a bilingual dictionary. A phrase in the source language, consisting of a head and a dependent, is translated into the target language by selecting both the nearest neighbor of the head given the dependent, and the nearest neighbor of the dependent given the head. This process is expanded to larger phrases by means of incremental composition. Experiments were performed on English and Spanish monolingual corpora in order to translate phrasal verbs in context. A new bilingual data set to evaluate strategies aimed at translating phrasal verbs in restricted syntactic domains has been created and released.
Anthology ID:
J19-3001
Volume:
Computational Linguistics, Volume 45, Issue 3 - September 2019
Month:
September
Year:
2019
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
395–421
Language:
URL:
https://aclanthology.org/J19-3001
DOI:
10.1162/coli_a_00353
Bibkey:
Cite (ACL):
Pablo Gamallo, Susana Sotelo, José Ramom Pichel, and Mikel Artetxe. 2019. Contextualized Translations of Phrasal Verbs with Distributional Compositional Semantics and Monolingual Corpora. Computational Linguistics, 45(3):395–421.
Cite (Informal):
Contextualized Translations of Phrasal Verbs with Distributional Compositional Semantics and Monolingual Corpora (Gamallo et al., CL 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/J19-3001.pdf