Speech Recognition Web Services for Dutch

Joris Pelemans, Kris Demuynck, Hugo Van hamme, Patrick Wambacq


Abstract
In this paper we present 3 applications in the domain of Automatic Speech Recognition for Dutch, all of which are developed using our in-house speech recognition toolkit SPRAAK. The speech-to-text transcriber is a large vocabulary continuous speech recognizer, optimized for Southern Dutch. It is capable to select components and adjust parameters on the fly, based on the observed conditions in the audio and was recently extended with the capability of adding new words to the lexicon. The grapheme-to-phoneme converter generates possible pronunciations for Dutch words, based on lexicon lookup and linguistic rules. The speech-text alignment system takes audio and text as input and constructs a time aligned output where every word receives exact begin and end times. All three of the applications (and others) are freely available, after registration, as a web application on http://www.spraak.org/webservice/ and in addition, can be accessed as a web service in automated tools.
Anthology ID:
L14-1200
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3041–3044
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/196_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Joris Pelemans, Kris Demuynck, Hugo Van hamme, and Patrick Wambacq. 2014. Speech Recognition Web Services for Dutch. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3041–3044, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Speech Recognition Web Services for Dutch (Pelemans et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/196_Paper.pdf