AI4D - African Language Dataset Challenge
Abstract
As language and speech technologies become more advanced, the lack of fundamental digital resources for African languages, such as data, spell checkers and PoS taggers, means that the digital divide between these languages and others keeps growing. This work details the organisation of the AI4D - African Language Dataset Challenge, an effort to incentivize the creation, curation and uncovering to African language datasets through a competitive challenge, particularly datasets that are annotated or prepared for use in a downstream NLP task.- Anthology ID:
- 2020.winlp-1.18
- Volume:
- Proceedings of the Fourth Widening Natural Language Processing Workshop
- Month:
- July
- Year:
- 2020
- Address:
- Seattle, USA
- Editors:
- Rossana Cunha, Samira Shaikh, Erika Varis, Ryan Georgi, Alicia Tsai, Antonios Anastasopoulos, Khyathi Raghavi Chandu
- Venue:
- WiNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 68–77
- Language:
- URL:
- https://aclanthology.org/2020.winlp-1.18
- DOI:
- 10.18653/v1/2020.winlp-1.18
- Cite (ACL):
- Kathleen Siminyu and Sackey Freshia. 2020. AI4D - African Language Dataset Challenge. In Proceedings of the Fourth Widening Natural Language Processing Workshop, pages 68–77, Seattle, USA. Association for Computational Linguistics.
- Cite (Informal):
- AI4D - African Language Dataset Challenge (Siminyu & Freshia, WiNLP 2020)