LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones

Amr Keleg


Abstract
Large Language Models (LLMs) have the potential of being a useful tool that can automate tasks, and assist humans. However, these models are more fluent in English and more aligned with Western cultures, norms, and values. Arabic-specific LLMs are being developed to better capture the nuances of the Arabic language, and the views of the Arabs. However, Arabs are sometimes assumed to share the same culture. In this position paper, we discuss the limitations of this assumption and provide our recommendations for how to curate better alignment data that models the cultural diversity within the Arab world.
Anthology ID:
2025.c3nlp-1.1
Volume:
Proceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Vinodkumar Prabhakaran, Sunipa Dev, Luciana Benotti, Daniel Hershcovich, Yong Cao, Li Zhou, Laura Cabello, Ife Adebara
Venues:
C3NLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–9
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.c3nlp-1.1/
DOI:
Bibkey:
Cite (ACL):
Amr Keleg. 2025. LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones. In Proceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025), pages 1–9, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones (Keleg, C3NLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.c3nlp-1.1.pdf