Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task
Salwa Saad Alahmari, Eric Atwell, Hadeel Saadany, Mohammad Alsalka
Abstract
This paper presents a study on sentiment anal- ysis of Dialectal Arabic (DA), with a particu- lar focus on Saudi and Moroccan (Darija) di- alects within the hospitality domain. We in- troduce a novel dataset comprising 698 Saudi Arabian proverbs annotated with sentiment polarity labels—Positive, Negative, and Neu- tral—collected from five major Saudi dialect regions: Najdi, Hijazi, Shamali, Janoubi, and Sharqawi. In addition to this, we used customer reviews for fine-tuning the CAMeLBERT-DA- SA model, which achieved a 75% F1 score in sentiment classification. To further evaluate the robustness of Arabic-centric models, we assessed the performance of three open-source large language models—Allam, ACeGPT, and Jais—in a zero-shot setting using the Ahasis shared task test set. Our results highlight the effectiveness of domain-specific fine-tuning in improving sentiment analysis performance and demonstrate the potential of Arabic-centric LLMs in zero-shot scenarios. This work con- tributes new linguistic resources and empirical insights to support ongoing research in senti- ment analysis for Arabic dialect- Anthology ID:
- 2025.ranlp-ahasis.11
- Volume:
- Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects
- Month:
- September
- Year:
- 2025
- Address:
- Varna, Bulgaria
- Editors:
- Maram Alharbi, Salmane Chafik, Saad Ezzini, Ruslan Mitkov, Tharindu Ranasinghe, Hansi Hettiarachchi
- Venues:
- RANLP | WS
- SIG:
- Publisher:
- INCOMA Ltd., Shoumen, Bulgaria
- Note:
- Pages:
- 69–75
- Language:
- URL:
- https://preview.aclanthology.org/corrections-2026-01/2025.ranlp-ahasis.11/
- DOI:
- Cite (ACL):
- Salwa Saad Alahmari, Eric Atwell, Hadeel Saadany, and Mohammad Alsalka. 2025. Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task. In Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects, pages 69–75, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
- Cite (Informal):
- Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task (Alahmari et al., RANLP 2025)
- PDF:
- https://preview.aclanthology.org/corrections-2026-01/2025.ranlp-ahasis.11.pdf