Flaviane Romani Fernandes


2006

pdf
A BLARK extension for temporal annotation mining
Dafydd Gibbon | Flaviane Romani Fernandes | Thorsten Trippel
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The Basic Language Resource Kit (BLARK) proposed by Krauwer is designed for the creation of initial textual resources. There are a number of toolkits for the development of spoken language resources and systems, but tools for second level resources, that is, resources which are the result of processing primary level speech resources such as speech recordings. Typically, processing of this kind in phonetics is done manually, with the aid of spreadsheets multi-purpose statistics software. We propose a Basic Language and Speech Kit (BLAST) as an extension to BLARK and suggest a strategy for integrating the kit into the Natural Language Toolkit (NLTK). The prototype kit is evaluated in an application to examining temporal properties of spoken Brazilian Portuguese.