BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology

Neli Hateva, Petar Mitankin, Stoyan Mihov


Abstract
In this paper we introduce a Bulgarian speech database, which was created for the purpose of ASR technology development. The paper describes the design and the content of the speech database. We present also an empirical evaluation of the performance of a LVCSR system for Bulgarian trained on the BulPhonC data. The resource is available free for scientific usage.
Anthology ID:
L16-1123
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
771–774
Language:
URL:
https://aclanthology.org/L16-1123
DOI:
Bibkey:
Cite (ACL):
Neli Hateva, Petar Mitankin, and Stoyan Mihov. 2016. BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 771–774, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology (Hateva et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/L16-1123.pdf