Addressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice

Marion Weller-Di Marco, Alexander Fraser, Sabine Schulte im Walde


Abstract
Many errors in phrase-based SMT can be attributed to problems on three linguistic levels: morphological complexity in the target language, structural differences and lexical choice. We explore combinations of linguistically motivated approaches to address these problems in English-to-German SMT and show that they are complementary to one another, but also that the popular verbal pre-ordering can cause problems on the morphological and lexical level. A discriminative classifier can overcome these problems, in particular when enriching standard lexical features with features geared towards verbal inflection.
Anthology ID:
E17-2099
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
625–630
Language:
URL:
https://aclanthology.org/E17-2099
DOI:
Bibkey:
Cite (ACL):
Marion Weller-Di Marco, Alexander Fraser, and Sabine Schulte im Walde. 2017. Addressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 625–630, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Addressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice (Weller-Di Marco et al., EACL 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/E17-2099.pdf