Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof

Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese, Uriel Pascal Elingui


Abstract
This article presents the data collected and ASR systems developped for 4 sub-saharan african languages (Swahili, Hausa, Amharic and Wolof). To illustrate our methodology, the focus is made on Wolof (a very under-resourced language) for which we designed the first ASR system ever built in this language. All data and scripts are available online on our github repository.
Anthology ID:
L16-1611
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3863–3867
Language:
URL:
https://aclanthology.org/L16-1611
DOI:
Bibkey:
Cite (ACL):
Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese, and Uriel Pascal Elingui. 2016. Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3863–3867, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof (Gauthier et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-1/L16-1611.pdf
Code
 besacier/ALFFA_PUBLIC