PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion
Meizhi Jin, Cheng Chen, Mengyuan Zhou, Mengfei Yuan, Xiaolong Hou, Xiyang Du, Lianxin Jiang, Jianyu Li
Abstract
This paper describes our system used in the SemEval-2023 Task12: Sentiment Analysis for Low-resource African Languages using Twit- ter Dataset (Muhammad et al., 2023c). The AfriSenti-SemEval Shared Task 12 is based on a collection of Twitter datasets in 14 African languages for sentiment classification. It con- sists of three sub-tasks. Task A is a monolin- gual sentiment classification which covered 12 African languages. Task B is a multilingual sen- timent classification which combined training data from Task A (12 African languages). Task C is a zero-shot sentiment classification. We uti- lized various strategies, including monolingual training, multilingual mixed training, and trans- lation technology, and proposed a weighted vot- ing method that combined the results of differ- ent strategies. Substantially, in the monolingual subtask, our system achieved Top-1 in two lan- guages (Yoruba and Twi) and Top-2 in four languages (Nigerian Pidgin, Algerian Arabic, and Swahili, Multilingual). In the multilingual subtask, Our system achived Top-2 in publish leaderBoard.- Anthology ID:
- 2023.semeval-1.93
- Volume:
- Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 679–685
- Language:
- URL:
- https://aclanthology.org/2023.semeval-1.93
- DOI:
- 10.18653/v1/2023.semeval-1.93
- Cite (ACL):
- Meizhi Jin, Cheng Chen, Mengyuan Zhou, Mengfei Yuan, Xiaolong Hou, Xiyang Du, Lianxin Jiang, and Jianyu Li. 2023. PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 679–685, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion (Jin et al., SemEval 2023)
- PDF:
- https://preview.aclanthology.org/corrections-2024-05/2023.semeval-1.93.pdf