The Samsung and University of Edinburgh’s submission to IWSLT17

Pawel Przybysz, Marcin Chochowski, Rico Sennrich, Barry Haddow, Alexandra Birch


Abstract
This paper describes the joint submission of Samsung Research and Development, Warsaw, Poland and the University of Edinburgh team to the IWSLT MT task for TED talks. We took part in two translation directions, en-de and de-en. We also participated in the en-de and de-en lectures SLT task. The models have been trained with an attentional encoder-decoder model using the BiDeep model in Nematus. We filtered the training data to reduce the problem of noisy data, and we use back-translated monolingual data for domain-adaptation. We demonstrate the effectiveness of the different techniques that we applied via ablation studies. Our submission system outperforms our baseline, and last year’s University of Edinburgh submission to IWSLT, by more than 5 BLEU.
Anthology ID:
2017.iwslt-1.3
Volume:
Proceedings of the 14th International Conference on Spoken Language Translation
Month:
December 14-15
Year:
2017
Address:
Tokyo, Japan
Editors:
Sakriani Sakti, Masao Utiyama
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
International Workshop on Spoken Language Translation
Note:
Pages:
23–28
Language:
URL:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2017.iwslt-1.3/
DOI:
Bibkey:
Cite (ACL):
Pawel Przybysz, Marcin Chochowski, Rico Sennrich, Barry Haddow, and Alexandra Birch. 2017. The Samsung and University of Edinburgh’s submission to IWSLT17. In Proceedings of the 14th International Conference on Spoken Language Translation, pages 23–28, Tokyo, Japan. International Workshop on Spoken Language Translation.
Cite (Informal):
The Samsung and University of Edinburgh’s submission to IWSLT17 (Przybysz et al., IWSLT 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2017.iwslt-1.3.pdf