The IOIT English ASR system for IWSLT 2016

Van Huy Nguyen, Trung-Nghia Phung, Tat Thang Vu, Chi Mai Luong


Abstract
This paper describes the speech recognition system of IOIT for IWSLT 2016. Four single DNN-based systems were developed to produce the 1st-pass lattices for the test sets using a baseline language model. The 2nd-pass lattices were further obtained by applying N-best list rescoring on topic adapted language models which were constructed from closed topic sentences by applying a text selection method. The final transcriptions of test sets were finally produced by combining the rescored results. On the 2013 evaluation set, we are able to reduce the word error rate of 1.62% absolute. On the 2014, provided as a development set, the word error rate of our transcription is 11.3%.
Anthology ID:
2016.iwslt-1.14
Volume:
Proceedings of the 13th International Conference on Spoken Language Translation
Month:
December 8-9
Year:
2016
Address:
Seattle, Washington D.C
Editors:
Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Rolando Cattoni, Marcello Federico
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
International Workshop on Spoken Language Translation
Note:
Pages:
Language:
URL:
https://aclanthology.org/2016.iwslt-1.14
DOI:
Bibkey:
Cite (ACL):
Van Huy Nguyen, Trung-Nghia Phung, Tat Thang Vu, and Chi Mai Luong. 2016. The IOIT English ASR system for IWSLT 2016. In Proceedings of the 13th International Conference on Spoken Language Translation, Seattle, Washington D.C. International Workshop on Spoken Language Translation.
Cite (Informal):
The IOIT English ASR system for IWSLT 2016 (Nguyen et al., IWSLT 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp22-frontmatter/2016.iwslt-1.14.pdf