Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof
Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese, Uriel Pascal Elingui
Abstract
This article presents the data collected and ASR systems developped for 4 sub-saharan african languages (Swahili, Hausa, Amharic and Wolof). To illustrate our methodology, the focus is made on Wolof (a very under-resourced language) for which we designed the first ASR system ever built in this language. All data and scripts are available online on our github repository.- Anthology ID:
- L16-1611
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 3863–3867
- Language:
- URL:
- https://aclanthology.org/L16-1611
- DOI:
- Cite (ACL):
- Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese, and Uriel Pascal Elingui. 2016. Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3863–3867, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof (Gauthier et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-1/L16-1611.pdf
- Code
- besacier/ALFFA_PUBLIC