The TALP&I2R SMT systems for IWSLT 2008.
Maxim Khalilov, Maria R. Costa-jussà, Carlos A. Henríquez Q., José A. R. Fonollosa, Adolfo Hernández H., José B. Mariño, Rafael E. Banchs, Chen Boxing, Min Zhang, Aiti Aw, Haizhou Li
Abstract
This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polite`cnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.- Anthology ID:
- 2008.iwslt-evaluation.17
- Volume:
- Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign
- Month:
- October 20-21
- Year:
- 2008
- Address:
- Waikiki, Hawaii
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- 116–123
- Language:
- URL:
- https://aclanthology.org/2008.iwslt-evaluation.17
- DOI:
- Cite (ACL):
- Maxim Khalilov, Maria R. Costa-jussà, Carlos A. Henríquez Q., José A. R. Fonollosa, Adolfo Hernández H., José B. Mariño, Rafael E. Banchs, Chen Boxing, Min Zhang, Aiti Aw, and Haizhou Li. 2008. The TALP&I2R SMT systems for IWSLT 2008.. In Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 116–123, Waikiki, Hawaii.
- Cite (Informal):
- The TALP&I2R SMT systems for IWSLT 2008. (Khalilov et al., IWSLT 2008)
- PDF:
- https://preview.aclanthology.org/add_acl24_videos/2008.iwslt-evaluation.17.pdf