Saheed Abdullahi Salahudeen - ACL Anthology

This is an internal preview of the ACL Anthology that may be incomplete and contain mistakes. Do not treat this content as an official publication.

Saheed Abdullahi Salahudeen

2023

pdf
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani | Marek Masiak | Israel Abebe Azime | Jesujoba Alabi | Atnafu Lambebo Tonja | Christine Mwase | Odunayo Ogundepo | Bonaventure F. P. Dossou | Akintunde Oladipo | Doreen Nixdorf | Chris Chinenye Emezue | Sana Al-azzawi | Blessing Sibanda | Davis David | Lolwethu Ndolela | Jonathan Mukiibi | Tunde Ajayi | Tatiana Moteu | Brian Odhiambo | Abraham Owodunni | Nnaemeka Obiefuna | Muhidin Mohamed | Shamsuddeen Hassan Muhammad | Teshome Mulugeta Ababu | Saheed Abdullahi Salahudeen | Mesay Gemeda Yigezu | Tajuddeen Gwadabe | Idris Abdulmumin | Mahlet Taye | Oluwabusayo Awoyomi | Iyanuoluwa Shode | Tolulope Adelani | Habiba Abdulganiyu | Abdul-Hakeem Omotayo | Adetola Adeeko | Abeeb Afolabi | Anuoluwapo Aremu | Olanrewaju Samuel | Clemencia Siro | Wangari Kimotho | Onyekachi Ogbu | Chinedu Mbonu | Chiamaka Chukwuneke | Samuel Fanijo | Jessica Ojo | Oyinkansola Awosan | Tadesse Kebede | Toadoum Sari Sakayo | Pamela Nyatsine | Freedmore Sidume | Oreen Yousuf | Mardiyyah Oduwole | Kanda Tshinu | Ussen Kimanuka | Thina Diko | Siyanda Nxakama | Sinodos Nigusse | Abdulmejid Johar | Shafie Mohamed | Fuad Mire Hassan | Moges Ahmed Mehamed | Evrard Ngabire | Jules Jules | Ivan Ssenkungu | Pontus Stenetorp
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf abs
HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis
Saheed Abdullahi Salahudeen | Falalu Ibrahim Lawan | Ahmad Wali | Amina Abubakar Imam | Aliyu Rabiu Shuaibu | Aliyu Yusuf | Nur Bala Rabiu | Musa Bello | Shamsuddeen Umaru Adamu | Saminu Mohammad Aliyu
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)

We present the findings of SemEval-2023 Task 12, a shared task on sentiment analysis for low-resource African languages using Twitter dataset. The task featured three subtasks; subtask A is monolingual sentiment classification with 12 tracks which are all monolingual languages, subtask B is multilingual sentiment classification using the tracks in subtask A and subtask C is a zero-shot sentiment classification. We present the results and findings of subtask A, subtask B and subtask C. We also release the code on github. Our goal is to leverage low-resource tweet data using pre-trained Afro-xlmr-large, AfriBERTa-Large, Bert-base-arabic-camelbert-da-sentiment (Arabic-camelbert), Multilingual-BERT (mBERT) and BERT models for sentiment analysis of 14 African languages. The datasets for these subtasks consists of a gold standard multi-class labeled Twitter datasets from these languages. Our results demonstrate that Afro-xlmr-large model performed better compared to the other models in most of the languages datasets. Similarly, Nigerian languages: Hausa, Igbo, and Yoruba achieved better performance compared to other languages and this can be attributed to the higher volume of data present in the languages.

pdf abs
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-information for Multi-level Sexism Classification
Saminu Mohammad Aliyu | Idris Abdulmumin | Shamsuddeen Hassan Muhammad | Ibrahim Said Ahmad | Saheed Abdullahi Salahudeen | Aliyu Yusuf | Falalu Ibrahim Lawan
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)

We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain - Reddit) for multilevel classification into Sexist or not Sexist, and other subsequent sub-classifications of the sexist data. We also use synthetic classification of unlabelled dataset and intermediary class information to maximize the performance of our models. We submitted a system in Task A, and it ranked 49th with F1-score of 0.82. This result showed to be competitive as it only under-performed the best system by 0.052%F1-score.

Co-authors

David Ifeoluwa Adelani 1

Israel Abebe Azime 1

Jesujoba Alabi 1

Atnafu Lambebo Tonja 1

Christine Mwase 1

Odunayo Ogundepo 1

Bonaventure F. P. Dossou 1

Akintunde Oladipo 1

Doreen Nixdorf 1

Chris Chinenye Emezue 1

Sana Al-Azzawi 1

Blessing Sibanda 1

Lolwethu Ndolela 1

Jonathan Mukiibi 1

Tatiana Moteu 1

Brian Odhiambo 1

Abraham Owodunni 1

Nnaemeka Obiefuna 1

Muhidin Mohamed 1

Teshome Mulugeta Ababu 1

Mesay Gemeda Yigezu 1

Tajuddeen Gwadabe 1

Oluwabusayo Awoyomi 1

Iyanuoluwa Shode 1

Tolulope Adelani 1

Habiba Abdulganiyu 1

Abdul-Hakeem Omotayo 1

Adetola Adeeko 1

Abeeb Afolabi 1

Anuoluwapo Aremu 1

Olanrewaju Samuel 1

Clemencia Siro 1

Wangari Kimotho 1

Onyekachi Ogbu 1

Chinedu Mbonu 1

Chiamaka Chukwuneke 1

Samuel Fanijo 1

Oyinkansola Awosan 1

Tadesse Kebede 1

Toadoum Sari Sakayo 1

Pamela Nyatsine 1

Freedmore Sidume 1

Mardiyyah Oduwole 1

Ussen Kimanuka 1

Siyanda Nxakama 1

Sinodos Nigusse 1

Abdulmejid Johar 1

Shafie Mohamed 1

Fuad Mire Hassan 1

Moges Ahmed Mehamed 1

Evrard Ngabire 1

Ivan Ssenkungu 1

Pontus Stenetorp 1

Amina Abubakar Imam 1

Aliyu Rabiu Shuaibu 1

Nur Bala Rabiu 1

Shamsuddeen Umaru Adamu 1

Ibrahim Sa’id Ahmad 1

Venues