The KIT translation systems for IWSLT 2012

Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel, Alexander Waibel


Abstract
In this paper, we present the KIT systems participating in the English-French TED Translation tasks in the framework of the IWSLT 2012 machine translation evaluation. We also present several additional experiments on the English-German, English-Chinese and English-Arabic translation pairs. Our system is a phrase-based statistical machine translation system, extended with many additional models which were proven to enhance the translation quality. For instance, it uses the part-of-speech (POS)-based reordering, translation and language model adaptation, bilingual language model, word-cluster language model, discriminative word lexica (DWL), and continuous space language model. In addition to this, the system incorporates special steps in the preprocessing and in the post-processing step. In the preprocessing the noisy corpora are filtered by removing the noisy sentence pairs, whereas in the postprocessing the agreement between a noun and its surrounding words in the French translation is corrected based on POS tags with morphological information. Our system deals with speech transcription input by removing case information and punctuation except periods from the text translation model.
Anthology ID:
2012.iwslt-evaluation.3
Volume:
Proceedings of the 9th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 6-7
Year:
2012
Address:
Hong Kong, Table of contents
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
38–45
Language:
URL:
https://aclanthology.org/2012.iwslt-evaluation.3
DOI:
Bibkey:
Cite (ACL):
Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel, and Alexander Waibel. 2012. The KIT translation systems for IWSLT 2012. In Proceedings of the 9th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 38–45, Hong Kong, Table of contents.
Cite (Informal):
The KIT translation systems for IWSLT 2012 (Mediani et al., IWSLT 2012)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2012.iwslt-evaluation.3.pdf