LILA: Cellular Telephone Speech Databases from Asia
Eric Sanders, Asuncion Moreno, Herbert Tropf, Lynette Melnar, Nurit Dekel, Breanna Gillies, Niklas Paulsson
Abstract
The goal of the LILA project was the collection of speech databases over cellular telephone networks of five languages in three Asian countries. Three languages were recorded in India: Hindi by first language speakers, Hindi by second language speakers and Indian English. Furthermore, Mandarin was recorded in China and Korean in South-Korea. The databases are part of the SpeechDat-family and follow the SpeechDat rules in many respects. All databases have been finished and have passed the validation tests. Both Hindi databases and the Korean database will be available to the public for sale.- Anthology ID:
- L08-1498
- Volume:
- Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
- Month:
- May
- Year:
- 2008
- Address:
- Marrakech, Morocco
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/278_paper.pdf
- DOI:
- Cite (ACL):
- Eric Sanders, Asuncion Moreno, Herbert Tropf, Lynette Melnar, Nurit Dekel, Breanna Gillies, and Niklas Paulsson. 2008. LILA: Cellular Telephone Speech Databases from Asia. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
- Cite (Informal):
- LILA: Cellular Telephone Speech Databases from Asia (Sanders et al., LREC 2008)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/278_paper.pdf