Kārlis Goba
2014
Designing the Latvian Speech Recognition Corpus
Mārcis Pinnis
|
Ilze Auziņa
|
Kārlis Goba
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
In this paper the authors present the first Latvian speech corpus designed specifically for speech recognition purposes. The paper outlines the decisions made in the corpus designing process through analysis of related work on speech corpora creation for different languages. The authors provide also guidelines that were used for the creation of the Latvian speech recognition corpus. The corpus creation guidelines are fairly general for them to be re-used by other researchers when working on different language speech recognition corpora. The corpus consists of two parts ― an orthographically annotated corpus containing 100 hours of orthographically transcribed audio data and a phonetically annotated corpus containing 4 hours of phonetically transcribed audio data. Metadata files in XML format provide additional details about the speakers, noise levels, speech styles, etc. The speech recognition corpus is phonetically balanced and phonetically rich and the paper describes also the methodology how the phonetical balancedness has been assessed.
2007
Development of Text-To-Speech system for Latvian
Kārlis Goba
|
Andrejs Vasiļjevs
Proceedings of the 16th Nordic Conference of Computational Linguistics (NODALIDA 2007)
Search