Carlos A. Henríquez Q.
2011
Deriving translation units using small additional corpora
Carlos A. Henríquez Q.
|
José B. Mariño
|
Rafael E. Banchs
Proceedings of the 15th Annual Conference of the European Association for Machine Translation
2008
The TALP&I2R SMT systems for IWSLT 2008.
Maxim Khalilov
|
Maria R. Costa-jussà
|
Carlos A. Henríquez Q.
|
José A. R. Fonollosa
|
Adolfo Hernández H.
|
José B. Mariño
|
Rafael E. Banchs
|
Chen Boxing
|
Min Zhang
|
Aiti Aw
|
Haizhou Li
Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign
This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polite`cnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.
Search
Co-authors
- José B. Mariño 2
- Rafael E. Banchs 2
- Maxim Khalilov 1
- Maria R. Costa-jussà 1
- José A. R. Fonollosa 1
- show all...