Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese
Antonio Moreno-Sandoval, Leonardo Campillos Llanos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui, Marta Garrote-Salazar
Abstract
This paper presents a method for designing, compiling and annotating corpora intended for language learners. In particular, we focus on spoken corpora for being used as complementary material in the classroom as well as in examinations. We describe the three corpora (Spanish, Chinese and Japanese) compiled by the Laboratorio de Lingüística Informática at the Autonomous University of Madrid (LLI-UAM). A web-based concordance tool has been used to search for examples in the corpus, and providing the text along with the corresponding audio. Teaching materials from the corpus, consisting the texts, the audio files and exercises on them, are currently on development.- Anthology ID:
- L12-1404
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2695–2701
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/697_Paper.pdf
- DOI:
- Cite (ACL):
- Antonio Moreno-Sandoval, Leonardo Campillos Llanos, Yang Dong, Emi Takamori, José M. Guirao, Paula Gozalo, Chieko Kimura, Kengo Matsui, and Marta Garrote-Salazar. 2012. Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2695–2701, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Spontaneous Speech Corpora for language learners of Spanish, Chinese and Japanese (Moreno-Sandoval et al., LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/697_Paper.pdf