The acquisition and dialog act labeling of the EDECAN-SPORTS corpus
Lluís-F. Hurtado, Fernando García, Emilio Sanchis, Encarna Segarra
Abstract
In this paper, we present the acquisition and labeling processes of the EDECAN-SPORTS corpus, which is a corpus that is oriented to the development of multimodal dialog systems acquired in Spanish and Catalan. Two Wizards of Oz were used in order to better simulate the behavior of an actual system in terms of both the information used by the different modules and the communication mechanisms between these modules. User and system dialog-act labeling, as well as other information, have been obtained automatically using this acquisition method Some preliminary experimental results with the acquired corpus show the appropriateness of the proposed acquisition method for the development of dialog systems- Anthology ID:
- L12-1156
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1416–1420
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/331_Paper.pdf
- DOI:
- Cite (ACL):
- Lluís-F. Hurtado, Fernando García, Emilio Sanchis, and Encarna Segarra. 2012. The acquisition and dialog act labeling of the EDECAN-SPORTS corpus. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1416–1420, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- The acquisition and dialog act labeling of the EDECAN-SPORTS corpus (Hurtado et al., LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/331_Paper.pdf