FBK@IWSLT 2007

Nicola Bertoldi, Mauro Cettolo, Roldano Cattoni, Marcello Federico


Abstract
This paper reports on the participation of FBK (formerly ITC-irst) at the IWSLT 2007 Evaluation. FBK participated in three tasks, namely Chinese-to-English, Japanese-to-English, and Italian-to-English. With respect to last year, translation systems were developed with the Moses Toolkit and the IRSTLM library, both available as open source software. Moreover, several novel ideas were investigated: the use of confusion networks in input to manage ambiguity in punctuation, the estimation of an additional language model by means of the Google’s Web 1T 5-gram collection, the combination of true case and lower case language models, and finally the use of multiple phrase-tables. By working on top of a state-of-the art baseline, experiments showed that the above methods accounted for significant BLEU score improvements.
Anthology ID:
2007.iwslt-1.11
Volume:
Proceedings of the Fourth International Workshop on Spoken Language Translation
Month:
October 15-16
Year:
2007
Address:
Trento, Italy
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2007.iwslt-1.11
DOI:
Bibkey:
Cite (ACL):
Nicola Bertoldi, Mauro Cettolo, Roldano Cattoni, and Marcello Federico. 2007. FBK@IWSLT 2007. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
Cite (Informal):
FBK@IWSLT 2007 (Bertoldi et al., IWSLT 2007)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2007.iwslt-1.11.pdf