Abstract
In this paper we introduce a Bulgarian speech database, which was created for the purpose of ASR technology development. The paper describes the design and the content of the speech database. We present also an empirical evaluation of the performance of a LVCSR system for Bulgarian trained on the BulPhonC data. The resource is available free for scientific usage.- Anthology ID:
- L16-1123
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 771–774
- Language:
- URL:
- https://aclanthology.org/L16-1123
- DOI:
- Cite (ACL):
- Neli Hateva, Petar Mitankin, and Stoyan Mihov. 2016. BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 771–774, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- BulPhonC: Bulgarian Speech Corpus for the Development of ASR Technology (Hateva et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/L16-1123.pdf