Integration of POStag-based Source Reordering into SMT Decoding by an Extended Search Graph

Josep M. Crego, José B. Mariño


Abstract
This paper presents a reordering framework for statistical machine translation (SMT) where source-side reorderings are integrated into SMT decoding, allowing for a highly constrained reordered search graph. The monotone search is extended by means of a set of reordering patterns (linguistically motivated rewrite patterns). Patterns are automatically learnt in training from word-to-word alignments and source-side Part-Of-Speech (POS) tags. Traversing the extended search graph, the decoder evaluates every hypothesis making use of a group of widely used SMT models and helped by an additional Ngram language model of source-side POS tags. Experiments are reported on the Euparl task (Spanish-to-English and English-to- Spanish). Results are presented regarding translation accuracy (using human and automatic evaluations) and computational efficiency, showing significant improvements in translation quality for both translation directions at a very low computational cost.
Anthology ID:
2006.amta-papers.4
Volume:
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
August 8-12
Year:
2006
Address:
Cambridge, Massachusetts, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
29–36
Language:
URL:
https://aclanthology.org/2006.amta-papers.4
DOI:
Bibkey:
Cite (ACL):
Josep M. Crego and José B. Mariño. 2006. Integration of POStag-based Source Reordering into SMT Decoding by an Extended Search Graph. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 29–36, Cambridge, Massachusetts, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Integration of POStag-based Source Reordering into SMT Decoding by an Extended Search Graph (Crego & Mariño, AMTA 2006)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2006.amta-papers.4.pdf