Aleksandra Zögling Markuš

Also published as: Aleksandra Zögling

2010

pdf abs
Acquisition and Annotation of Slovenian Lombard Speech Database
Damjan Vlaj | Aleksandra Zögling Markuš | Marko Kos | Zdravko Kačič
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This paper presents the acquisition and annotation of Slovenian Lombard Speech Database, the recording of which started in the year 2008. The database was recorded at the University of Maribor, Slovenia. The goal of this paper is to describe the hardware platform used for the acquisition of speech material, recording scenarios and tools used for the annotation of Slovenian Lombard Speech Database. The database consists of recordings of 10 Slovenian native speakers. Five males and five females were recorded. Each speaker pronounced a set of eight corpuses in two recording sessions with at least one week pause between recordings. The structure of the corpus is similar to SpeechDat II database. Approximately 30 minutes of speech material per speaker and per session was recorded. The manual annotation of speech material is performed with the LombardSpeechLabel tool developed at the University of Maribor. The speech and annotation material was saved on 10 DVDs (one speaker on one DVD).

2006

pdf abs
SINOD - Slovenian non-native speech database
Andrej Žgank | Darinka Verdonik | Aleksandra Zögling Markuš | Zdravko Kačič
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

This paper presents the SINOD database, which is the first Slovenian non-native speech database. It will be used to improve the performance of large vocabulary continuous speech recogniser for non-native speakers. The main quality impact is expected for acoustic models and recognisers vocabulary. The SINOD database is designed as supplement to the Slovenian BNSI Broadcast News database. The same BN recommendations were used for both databases. Two interviews with non-native Slovenian speakers were incorporated in the set. Both non-native speakers were female, whereas the journalist was Slovenian native male speaker. The transcription approach applied in the production phase is presented. Different statistics and analyses of database are given in the paper.