Coupling Statistical Machine Translation with Rule-based Transfer and Generation

Arafat Ahsan, Prasanth Kolachina, Sudheer Kolachina, Dipti Misra, Rajeev Sangal


Abstract
In this paper, we present the insights gained from a detailed study of coupling a highly modular English-Hindi RBMT system with a standard phrase-based SMT system. Coupling the RBMT and SMT systems at various stages in the RBMT pipeline, we observe the effects of the source transformations at each stage on the performance of the coupled MT system. We propose an architecture that systematically exploits the structural transfer and robust generation capabilities of the RBMT system. Working with the English-Hindi language pair, we show that the coupling configurations explored in our experiments help address different aspects of the typological divergence between these languages. In spite of working with very small datasets, we report significant improvements both in terms of BLEU (7.14 and 0.87 over the RBMT and the SMT baselines respectively) and subjective evaluation (relative decrease of 17% in SSER).
Anthology ID:
2010.amta-papers.6
Volume:
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers
Month:
October 31-November 4
Year:
2010
Address:
Denver, Colorado, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2010.amta-papers.6
DOI:
Bibkey:
Cite (ACL):
Arafat Ahsan, Prasanth Kolachina, Sudheer Kolachina, Dipti Misra, and Rajeev Sangal. 2010. Coupling Statistical Machine Translation with Rule-based Transfer and Generation. In Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, Denver, Colorado, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Coupling Statistical Machine Translation with Rule-based Transfer and Generation (Ahsan et al., AMTA 2010)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2010.amta-papers.6.pdf