New bilingual speech databases for audio diarization
David Tavarez, Eva Navas, Daniel Erro, Ibon Saratxaga, Inma Hernaez
Abstract
This paper describes the process of collecting and recording two new bilingual speech databases in Spanish and Basque. They are designed primarily for speaker diarization in two different application domains: broadcast news audio and recorded meetings. First, both databases have been manually segmented. Next, several diarization experiments have been carried out in order to evaluate them. Our baseline speaker diarization system has been applied to both databases with around 30% of DER for broadcast news audio and 40% of DER for recorded meetings. Also, the behavior of the system when different languages are used by the same speaker has been tested.- Anthology ID:
- L14-1620
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2666–2670
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/799_Paper.pdf
- DOI:
- Cite (ACL):
- David Tavarez, Eva Navas, Daniel Erro, Ibon Saratxaga, and Inma Hernaez. 2014. New bilingual speech databases for audio diarization. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2666–2670, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- New bilingual speech databases for audio diarization (Tavarez et al., LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/799_Paper.pdf