The 2011 KIT English ASR system for the IWSLT evaluation

Sebastian Stüker, Kevin Kilgour, Christian Saam, Alex Waibel


Abstract
This paper describes our English Speech-to-Text (STT) system for the 2011 IWSLT ASR track. The system consists of 2 subsystems with different front-ends—one MVDR based, one MFCC based—which are combined using confusion network combination to provide a base for a second pass speaker adapted MVDR system. We demonstrate that this set-up produces competitive results on the IWSLT 2010 dev and test sets.
Anthology ID:
2011.iwslt-evaluation.12
Volume:
Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 8-9
Year:
2011
Address:
San Francisco, California
Editors:
Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
94–97
Language:
URL:
https://aclanthology.org/2011.iwslt-evaluation.12
DOI:
Bibkey:
Cite (ACL):
Sebastian Stüker, Kevin Kilgour, Christian Saam, and Alex Waibel. 2011. The 2011 KIT English ASR system for the IWSLT evaluation. In Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 94–97, San Francisco, California.
Cite (Informal):
The 2011 KIT English ASR system for the IWSLT evaluation (Stüker et al., IWSLT 2011)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-2024-clasp/2011.iwslt-evaluation.12.pdf