Interoperability of audio corpora : the case of the French corpora

Olivier Baude, Michel Jacobson, Atanas Tchobanov, Richard Walter


Abstract
We present here the choices which were made within the framework of three oral corpora projects: Socio-linguistics studies on Orleans (ESLO), Phonology of the Contemporary French (PFC), the Archivage corpus of the LACITO lab. This comparative presentation of three corpora of audio linguistic resources comes from a analysis about the options the project have to operate to describe them for discovery purposes and to compare the contents. The aim is to illustrate the interest to think the interoperability and the methodology of codings and the metadata. Through this step, we want to simplify the technical creation of audio corpora and thus the constitution of linguistic resources, usable by enlarged academic and industrial communities.
Anthology ID:
L06-1254
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/430_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Olivier Baude, Michel Jacobson, Atanas Tchobanov, and Richard Walter. 2006. Interoperability of audio corpora : the case of the French corpora. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
Interoperability of audio corpora : the case of the French corpora (Baude et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/430_pdf.pdf