Barcelona Media SMT system description for the IWSLT 2009

Marta R. Costa-jussà, Rafael E. Banchs


Abstract
This paper describes the Barcelona Media SMT system in the IWSLT 2009 evaluation campaign. The Barcelona Media system is an statistical phrase-based system enriched with source context information. Adding source context in an SMT system is interesting to enhance the translation in order to solve lexical and structural choice errors. The novel technique uses a similarity metric among each test sentence and each training sentence. First experimental results of this technique are reported in the Arabic and Chinese Basic Traveling Expression Corpus (BTEC) task. Although working in a single domain, there are ambiguities in SMT translation units and slight improvements in BLEU are shown in both tasks (Zh2En and Ar2En).
Anthology ID:
2009.iwslt-evaluation.3
Volume:
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 1-2
Year:
2009
Address:
Tokyo, Japan
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
24–28
Language:
URL:
https://aclanthology.org/2009.iwslt-evaluation.3
DOI:
Bibkey:
Cite (ACL):
Marta R. Costa-jussà and Rafael E. Banchs. 2009. Barcelona Media SMT system description for the IWSLT 2009. In Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 24–28, Tokyo, Japan.
Cite (Informal):
Barcelona Media SMT system description for the IWSLT 2009 (Costa-jussà & Banchs, IWSLT 2009)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2009.iwslt-evaluation.3.pdf
Presentation:
 2009.iwslt-evaluation.3.Presentation.pdf