2016
pdf
abs
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
Eduardo Coutinho
|
Florian Hönig
|
Yue Zhang
|
Simone Hantke
|
Anton Batliner
|
Elmar Nöth
|
Björn Schuller
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
In this paper, we describe a new database with audio recordings of non-native (L2) speakers of English, and the perceptual evaluation experiment conducted with native English speakers for assessing the prosody of each recording. These annotations are then used to compute the gold standard using different methods, and a series of regression experiments is conducted to evaluate their impact on the performance of a regression model predicting the degree of naturalness of L2 speech. Further, we compare the relevance of different feature groups modelling prosody in general (without speech tempo), speech rate and pauses modelling speech tempo (fluency), voice quality, and a variety of spectral features. We also discuss the impact of various fusion strategies on performance. Overall, our results demonstrate that the prosody of non-native speakers of English as L2 can be reliably assessed using supra-segmental audio features; prosodic features seem to be the most important ones.
2004
pdf
abs
“You Stupid Tin Box” - Children Interacting with the AIBO Robot: A Cross-linguistic Emotional Speech Corpus
A. Batliner
|
C. Hacker
|
S. Steidl
|
E. Nöth
|
S. D’Arcy
|
M. Russell
|
M. Wong
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
This paper deals with databases that combine different aspects: children's speech, emotional speech, human-robot communication, cross-linguistics, and read vs. spontaneous speech: in a Wizard-of-Oz scenario, German and English children had to instruct Sony's AIBO robot to fulfil specific tasks. In one experimental condition, strictly parallel for German and English, the AIBO behaved `disobedient' by following it's own script irrespective of the child's commands. By that, reactions of different children to the same sequence of AIBO's actions could be obtained. In addition, both the German and the English children were recorded reading texts. The data are transliterated orthographically; emotional user states and some other phenomena will be annotated. We report preliminary word recognition rates and classification results.
1996
pdf
Integrating Syntactic and Prosodic Information for the Efficient Detection of Empty Categories
Anton Batliner
|
Anke Feldhaus
|
Stefan Geifiler
|
Andreas Kieflling
|
Tibor Kiss
|
Ralf Kompe
|
Elmar Noth
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics