Abstract
This paper describes our system submission for SemEval-2023 Task 12 AfriSenti-SemEval: Sentiment Analysis for African Languages. We propose an XGBoost-based ensemble model trained on emoticon frequency-based features and the predictions of several statistical models such as SVMs, Logistic Regression, Random Forests, and BERT-based pre-trained language models such as AfriBERTa and AfroXLMR. We also report results from additional experiments not in the system. Our system achieves a mixed bag of results, achieving a best rank of 7th in three of the languages - Igbo, Twi, and Yoruba.- Anthology ID:
- 2023.semeval-1.48
- Volume:
- Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023)
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 357–364
- Language:
- URL:
- https://aclanthology.org/2023.semeval-1.48
- DOI:
- Cite (ACL):
- Monil Gokani, K V Aditya Srivatsa, and Radhika Mamidi. 2023. Witcherses at SemEval-2023 Task 12: Ensemble Learning for African Sentiment Analysis. In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 357–364, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Witcherses at SemEval-2023 Task 12: Ensemble Learning for African Sentiment Analysis (Gokani et al., SemEval 2023)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2023.semeval-1.48.pdf