Abstract
This paper reports on the participation of FBK (formerly ITC-irst) at the IWSLT 2007 Evaluation. FBK participated in three tasks, namely Chinese-to-English, Japanese-to-English, and Italian-to-English. With respect to last year, translation systems were developed with the Moses Toolkit and the IRSTLM library, both available as open source software. Moreover, several novel ideas were investigated: the use of confusion networks in input to manage ambiguity in punctuation, the estimation of an additional language model by means of the Google’s Web 1T 5-gram collection, the combination of true case and lower case language models, and finally the use of multiple phrase-tables. By working on top of a state-of-the art baseline, experiments showed that the above methods accounted for significant BLEU score improvements.- Anthology ID:
- 2007.iwslt-1.11
- Volume:
- Proceedings of the Fourth International Workshop on Spoken Language Translation
- Month:
- October 15-16
- Year:
- 2007
- Address:
- Trento, Italy
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- Language:
- URL:
- https://aclanthology.org/2007.iwslt-1.11
- DOI:
- Cite (ACL):
- Nicola Bertoldi, Mauro Cettolo, Roldano Cattoni, and Marcello Federico. 2007. FBK@IWSLT 2007. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
- Cite (Informal):
- FBK@IWSLT 2007 (Bertoldi et al., IWSLT 2007)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/2007.iwslt-1.11.pdf