Abstract
This year FBK took part in the BTEC translation task, with source languages Arabic and Turkish and target language English, and in the new TALK task, source English and target French. We worked in the framework of phrase-based statistical machine translation aiming to improve coverage of models in presence of rich morphology, on one side, and to make better use of available resources through data selection techniques. New morphological segmentation rules were developed for Turkish-English. The combination of several Turkish segmentation schemes into a lattice input led to an improvement wrt to last year. The use of additional training data was explored for Arabic-English, while on the English to French task improvement was achieved over a strong baseline by automatically selecting relevant and high quality data from the available training corpora.- Anthology ID:
- 2010.iwslt-evaluation.5
- Volume:
- Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign
- Month:
- December 2-3
- Year:
- 2010
- Address:
- Paris, France
- Venue:
- IWSLT
- SIG:
- SIGSLT
- Publisher:
- Note:
- Pages:
- 53–58
- Language:
- URL:
- https://aclanthology.org/2010.iwslt-evaluation.5
- DOI:
- Cite (ACL):
- Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, and Marcello Federico. 2010. FBK @ IWSLT 2010. In Proceedings of the 7th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 53–58, Paris, France.
- Cite (Informal):
- FBK @ IWSLT 2010 (Bisazza et al., IWSLT 2010)
- PDF:
- https://preview.aclanthology.org/starsem-semeval-split/2010.iwslt-evaluation.5.pdf