Abigail Oppong
2026
Afri-MCQA: Multimodal Cultural Question Answering for African Languages
Atnafu Lambebo Tonja | Srija Anand | Emilio Villa-Cueva | Israel Abebe Azime | Jesujoba Oluwadara Alabi | Muhidin A. Mohamed | Debela Desalegn Yadeta | Negasi Haile Abadi | Abigail Oppong | Nnaemeka Casmir Obiefuna | Idris Abdulmumin | Naome A Etori | Eric Peter Wairagala | Kanda Patrick Tshinu | Imanigirimbabazi Emmanuel | Gabofetswe Malema | Alham Fikri Aji | David Ifeoluwa Adelani | Thamar Solorio
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Atnafu Lambebo Tonja | Srija Anand | Emilio Villa-Cueva | Israel Abebe Azime | Jesujoba Oluwadara Alabi | Muhidin A. Mohamed | Debela Desalegn Yadeta | Negasi Haile Abadi | Abigail Oppong | Nnaemeka Casmir Obiefuna | Idris Abdulmumin | Naome A Etori | Eric Peter Wairagala | Kanda Patrick Tshinu | Imanigirimbabazi Emmanuel | Gabofetswe Malema | Alham Fikri Aji | David Ifeoluwa Adelani | Thamar Solorio
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Africa is home to over one-third of the world’s languages, yet remains severely underrepresented in multimodal AI research. We introduce Afri-MCQA, the first Multilingual Cultural Question-Answering benchmark containing 7.5k Q A pairs across 15 African languages from 12 countries. The benchmark offers parallel text and speech modalities and was entirely created by native speakers. We find that models show poor performance across evaluated cultures, with near-zero accuracy on open-ended VQA when queried through native language or speech. To test linguistic competence, we include control experiments meant to assess this specific aspect separate from cultural knowledge, and we observe significant performance gaps between native languages and English for both text and speech. These findings underscore the pressing need for speech-first approaches, culturally grounded pretraining, and cross-lingual cultural transfer. We release Afri-MCQA to support more inclusive multimodal AI development.
2025
Examining the Cultural Encoding of Gender Bias in LLMs for Low-Resourced African Languages
Abigail Oppong | Hellina Hailu Nigatu | Chinasa T. Okolo
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Abigail Oppong | Hellina Hailu Nigatu | Chinasa T. Okolo
Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP)
Large Language Models (LLMs) are deployed in several aspects of everyday life. While the technology could have several benefits, like many socio-technical systems, it also encodes several biases. Trained on large, crawled datasets from the web, these models perpetuate stereotypes and regurgitate representational bias that is rampant in their training data. Languages encode gender in varying ways; some languages are grammatically gendered, while others do not. Bias in the languages themselves may also vary based on cultural, social, and religious contexts. In this paper, we investigate gender bias in LLMs by selecting two languages, Twi and Amharic. Twi is a non-gendered African language spoken in Ghana, while Amharic is a gendered language spoken in Ethiopia. Using these two languages on the two ends of the continent and their opposing grammatical gender system, we evaluate LLMs in three tasks: Machine Translation, Image Generation, and Sentence Completion. Our results give insights into the gender bias encoded in LLMs using two low-resourced languages and broaden the conversation on how culture and social structures play a role in disparate system performances.
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
Shamsuddeen Hassan Muhammad | Idris Abdulmumin | Abinew Ali Ayele | David Ifeoluwa Adelani | Ibrahim Said Ahmad | Saminu Mohammad Aliyu | Paul Röttger | Abigail Oppong | Andiswa Bukula | Chiamaka Ijeoma Chukwuneke | Ebrahim Chekol Jibril | Elyas Abdi Ismail | Esubalew Alemneh | Hagos Tesfahun Gebremichael | Lukman Jibril Aliyu | Meriem Beloucif | Oumaima Hourrane | Rooweither Mabuya | Salomey Osei | Samuel Rutunda | Tadesse Destaw Belay | Tadesse Kebede Guge | Tesfa Tegegne Asfaw | Lilian Diana Awuor Wanzare | Nelson Odhiambo Onyango | Seid Muhie Yimam | Nedjma Ousidhoum
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Shamsuddeen Hassan Muhammad | Idris Abdulmumin | Abinew Ali Ayele | David Ifeoluwa Adelani | Ibrahim Said Ahmad | Saminu Mohammad Aliyu | Paul Röttger | Abigail Oppong | Andiswa Bukula | Chiamaka Ijeoma Chukwuneke | Ebrahim Chekol Jibril | Elyas Abdi Ismail | Esubalew Alemneh | Hagos Tesfahun Gebremichael | Lukman Jibril Aliyu | Meriem Beloucif | Oumaima Hourrane | Rooweither Mabuya | Salomey Osei | Samuel Rutunda | Tadesse Destaw Belay | Tadesse Kebede Guge | Tesfa Tegegne Asfaw | Lilian Diana Awuor Wanzare | Nelson Odhiambo Onyango | Seid Muhie Yimam | Nedjma Ousidhoum
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Hate speech and abusive language are global phenomena that need socio-cultural background knowledge to be understood, identified, and moderated. However, in many regions of the Global South, there have been several documented occurrences of (1) absence of moderation and (2) censorship due to the reliance on keyword spotting out of context. Further, high-profile individuals have frequently been at the center of the moderation process, while large and targeted hate speech campaigns against minorities have been overlooked.These limitations are mainly due to the lack of high-quality data in the local languages and the failure to include local communities in the collection, annotation, and moderation processes. To address this issue, we present AfriHate: a multilingual collection of hate speech and abusive language datasets in 15 African languages. Each instance in AfriHate is a tweet annotated by native speakers familiar with the regional culture. We report the challenges related to the construction of the datasets and present various classification baseline results with and without using LLMs. We find that model performance highly depends on the language and that multilingual models can help boost performance in low-resource settings.
2022
AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages
Bonaventure F. P. Dossou | Atnafu Lambebo Tonja | Oreen Yousuf | Salomey Osei | Abigail Oppong | Iyanuoluwa Shode | Oluwabusayo Olufunke Awoyomi | Chris Emezue
Proceedings of the Third Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)
Bonaventure F. P. Dossou | Atnafu Lambebo Tonja | Oreen Yousuf | Salomey Osei | Abigail Oppong | Iyanuoluwa Shode | Oluwabusayo Olufunke Awoyomi | Chris Emezue
Proceedings of the Third Workshop on Simple and Efficient Natural Language Processing (SustaiNLP)
In recent years, multilingual pre-trained language models have gained prominence due to their remarkable performance on numerous downstream Natural Language Processing tasks (NLP). However, pre-training these large multilingual language models requires a lot of training data, which is not available for African Languages. Active learning is a semi-supervised learning algorithm, in which a model consistently and dynamically learns to identify the most beneficial samples to train itself on, in order to achieve better optimization and performance on downstream tasks. Furthermore, active learning effectively and practically addresses real-world data scarcity. Despite all its benefits, active learning, in the context of NLP and especially multilingual language models pretraining, has received little consideration. In this paper, we present AfroLM, a multilingual language model pretrained from scratch on 23 African languages (the largest effort to date) using our novel self-active learning framework. Pretrained on a dataset significantly (14x) smaller than existing baselines, AfroLM outperforms many multilingual pretrained language models (AfriBERTa, XLMR-base, mBERT) on various NLP downstream tasks (NER, text classification, and sentiment analysis). Additional out-of-domain sentiment analysis experiments show that AfroLM is able to generalize well across various domains. We release the code source, and our datasets used in our framework at https://github.com/bonaventuredossou/MLM_AL.
Search
Fix author
Co-authors
- Idris Abdulmumin 2
- David Ifeoluwa Adelani 2
- Salomey Osei 2
- Atnafu Lambebo Tonja 2
- Negasi Haile Abadi 1
- Ibrahim Said Ahmad 1
- Alham Fikri Aji 1
- Jesujoba Alabi 1
- Esubalew Alemneh 1
- Saminu Mohammad Aliyu 1
- Lukman Jibril Aliyu 1
- Srija Anand 1
- Tesfa Tegegne Asfaw 1
- Oluwabusayo Olufunke Awoyomi 1
- Abinew Ali Ayele 1
- Israel Abebe Azime 1
- Tadesse Destaw Belay 1
- Meriem Beloucif 1
- Andiswa Bukula 1
- Chiamaka Ijeoma Chukwuneke 1
- Bonaventure F. P. Dossou 1
- Chris Chinenye Emezue 1
- Imanigirimbabazi Emmanuel 1
- Naome A. Etori 1
- Hagos Tesfahun Gebremichael 1
- Tadesse Kebede Guge 1
- Oumaima Hourrane 1
- Elyas Abdi Ismail 1
- Ebrahim Chekol Jibril 1
- Rooweither Mabuya 1
- Gabofetswe Malema 1
- Muhidin A. Mohamed 1
- Shamsuddeen Hassan Muhammad 1
- Hellina Hailu Nigatu 1
- Nnaemeka Casmir Obiefuna 1
- Chinasa T. Okolo 1
- Nelson Odhiambo Onyango 1
- Nedjma Ousidhoum 1
- Samuel Rutunda 1
- Paul Röttger 1
- Iyanuoluwa Shode 1
- Thamar Solorio 1
- Kanda Patrick Tshinu 1
- Emilio Villa-Cueva 1
- Eric Peter Wairagala 1
- Lilian Diana Awuor Wanzare 1
- Debela Desalegn Yadeta 1
- Seid Muhie Yimam 1
- Oreen Yousuf 1