Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian

Mārcis Pinnis, Askars Salimbajevs, Ilze Auziņa


Abstract
In this paper the authors present a speech corpus designed and created for the development and evaluation of dictation systems in Latvian. The corpus consists of over nine hours of orthographically annotated speech from 30 different speakers. The corpus features spoken commands that are common for dictation systems for text editors. The corpus is evaluated in an automatic speech recognition scenario. Evaluation results in an ASR dictation scenario show that the addition of the corpus to the acoustic model training data in combination with language model adaptation allows to decrease the WER by up to relative 41.36% (or 16.83% in absolute numbers) compared to a baseline system without language model adaptation. Contribution of acoustic data augmentation is at relative 12.57% (or 3.43% absolute).
Anthology ID:
L16-1124
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
775–780
Language:
URL:
https://aclanthology.org/L16-1124
DOI:
Bibkey:
Cite (ACL):
Mārcis Pinnis, Askars Salimbajevs, and Ilze Auziņa. 2016. Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 775–780, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Designing a Speech Corpus for the Development and Evaluation of Dictation Systems in Latvian (Pinnis et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/L16-1124.pdf