Phrase-based statistical machine translation with pivot languages.

Nicola Bertoldi, Madalina Barbaiani, Marcello Federico, Roldano Cattoni


Abstract
Translation with pivot languages has recently gained attention as a means to circumvent the data bottleneck of statistical machine translation (SMT). This paper tries to give a mathematically sound formulation of the various approaches presented in the literature and introduces new methods for training alignment models through pivot languages. We present experimental results on Chinese-Spanish translation via English, on a popular traveling domain task. In contrast to previous literature, we report experimental results by using parallel corpora that are either disjoint or overlapped on the pivot language side. Finally, our original method for generating training data through random sampling shows to perform as well as the best methods based on the coupling of translation systems.
Anthology ID:
2008.iwslt-papers.1
Volume:
Proceedings of the 5th International Workshop on Spoken Language Translation: Papers
Month:
October 20-21
Year:
2008
Address:
Waikiki, Hawaii
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
143–149
Language:
URL:
https://aclanthology.org/2008.iwslt-papers.1
DOI:
Bibkey:
Cite (ACL):
Nicola Bertoldi, Madalina Barbaiani, Marcello Federico, and Roldano Cattoni. 2008. Phrase-based statistical machine translation with pivot languages.. In Proceedings of the 5th International Workshop on Spoken Language Translation: Papers, pages 143–149, Waikiki, Hawaii.
Cite (Informal):
Phrase-based statistical machine translation with pivot languages. (Bertoldi et al., IWSLT 2008)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2008.iwslt-papers.1.pdf