The DCU machine translation systems for IWSLT 2011

Pratyush Banerjee, Hala Almaghout, Sudip Naskar, Johann Roturier, Jie Jiang, Andy Way, Josef van Genabith


Abstract
In this paper, we provide a description of the Dublin City University’s (DCU) submissions in the IWSLT 2011 evaluationcampaign.1 WeparticipatedintheArabic-Englishand Chinese-English Machine Translation(MT) track translation tasks. We use phrase-based statistical machine translation (PBSMT) models to create the baseline system. Due to the open-domain nature of the data to be translated, we use domain adaptation techniques to improve the quality of translation. Furthermore, we explore target-side syntactic augmentation for an Hierarchical Phrase-Based (HPB) SMT model. Combinatory Categorial Grammar (CCG) is used to extract labels for target-side phrases and non-terminals in the HPB system. Combining the domain adapted language models with the CCG-augmented HPB system gave us the best translations for both language pairs providing statistically significant improvements of 6.09 absolute BLEU points (25.94% relative) and 1.69 absolute BLEU points (15.89% relative) over the unadapted PBSMT baselines for the Arabic-English and Chinese-English language pairs, respectively.
Anthology ID:
2011.iwslt-evaluation.4
Volume:
Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 8-9
Year:
2011
Address:
San Francisco, California
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
41–48
Language:
URL:
https://aclanthology.org/2011.iwslt-evaluation.4
DOI:
Bibkey:
Cite (ACL):
Pratyush Banerjee, Hala Almaghout, Sudip Naskar, Johann Roturier, Jie Jiang, Andy Way, and Josef van Genabith. 2011. The DCU machine translation systems for IWSLT 2011. In Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 41–48, San Francisco, California.
Cite (Informal):
The DCU machine translation systems for IWSLT 2011 (Banerjee et al., IWSLT 2011)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2011.iwslt-evaluation.4.pdf