Wieneke Wesseling
2008
The IFADV Corpus: a Free Dialog Video Corpus
Rob van Son
|
Wieneke Wesseling
|
Eric Sanders
|
Henk van den Heuvel
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright restrictions. A freely available annotated corpus is presented, gratis and libre, of high quality video recordings of face-to-face conversational speech. Annotations include orthography, POS tags, and automatically generated phonemes transcriptions and word boundaries. In addition, labeling of both simple conversational function and gaze direction has been a performed. Within the bounds of the law, everything has been done to remove copyright and use restrictions. Annotations have been processed to RDBMS tables that allow SQL queries and direct connections to statistical software. From our experiences we would like to advocate the formulation of best practises for both legal handling and database storage of recordings and annotations.
2005
Early Preparation of Experimentally Elicited Minimal Responses
Wieneke Wesseling
|
R. J. J. H. van Son
Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue
Search