Morpho-syntactic analysis for reordering in statistical machine translation

Sonja Niessen, Hermann Ney


Abstract
In the framework of statistical machine translation (SMT), correspondences between the words in the source and the target language are learned from bilingual corpora on the basis of so-called alignment models. Among other things these are meant to capture the differences in word order in different languages. In this paper we show that SMT can take advantage of the explicit introduction of some linguistic knowledge about the sentence structure in the languages under consideration. In contrast to previous publications dealing with the incorporation of morphological and syntactic information into SMT, we focus on two aspects of reordering for the language pair German and English, namely question inversion and detachable German verb prefixes. The results of systematic experiments are reported and demonstrate the applicability of the approach to both translation directions on a German-English corpus.
Anthology ID:
2001.mtsummit-papers.45
Volume:
Proceedings of Machine Translation Summit VIII
Month:
September 18-22
Year:
2001
Address:
Santiago de Compostela, Spain
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2001.mtsummit-papers.45
DOI:
Bibkey:
Cite (ACL):
Sonja Niessen and Hermann Ney. 2001. Morpho-syntactic analysis for reordering in statistical machine translation. In Proceedings of Machine Translation Summit VIII, Santiago de Compostela, Spain.
Cite (Informal):
Morpho-syntactic analysis for reordering in statistical machine translation (Niessen & Ney, MTSummit 2001)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2001.mtsummit-papers.45.pdf