The Arabic Bible as an Evaluation Tool: The Case Study of the Khalīlī Arabic Dialect

Jakub Zbrzeżny, Ehud Reiter, Wei Zhao


Abstract
The paper presents a fully documented case study of how high-quality data combined with evaluators’ expertise can be utilised for conducting basic NLP experiments in the realm of low-resource languages such as local varieties of Colloquial Arabic, and how the Arabic Bible, hitherto underutilised in NLP, can serve as an evaluation tool. Our experiments on one of the rural Palestinian Arabic dialects of al-Khalīl / Hebron illustrate two points. On the one hand, popular models are clearly limited in their ability to produce outputs of a high level of dialectal specificity (here: rural area surrounding a major urban centre). On the other hand, they are capable to generate accurate translations from such dialects into Modern Standard Arabic. Thus, the models appear better at understanding dialects than at producing dialects.
Anthology ID:
2026.retroeval-main.4
Volume:
Proceedings of the 1st Symposium on Natural Language Generation Evaluations
Month:
June
Year:
2026
Address:
Aberdeen, United Kingdom
Editors:
Saad Mahamood, David M. Howcroft, Kees van Deemter, Simone Balloccu, Adarsa Sivaprasad, Barkavi Sundararajan, Alberto Bugarín Diz, Jose María Alonso-Moral
Venue:
RetroEval
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
24–32
Language:
URL:
https://preview.aclanthology.org/ingest-retroeval/2026.retroeval-main.4/
DOI:
Bibkey:
Cite (ACL):
Jakub Zbrzeżny, Ehud Reiter, and Wei Zhao. 2026. The Arabic Bible as an Evaluation Tool: The Case Study of the Khalīlī Arabic Dialect. In Proceedings of the 1st Symposium on Natural Language Generation Evaluations, pages 24–32, Aberdeen, United Kingdom. Association for Computational Linguistics.
Cite (Informal):
The Arabic Bible as an Evaluation Tool: The Case Study of the Khalīlī Arabic Dialect (Zbrzeżny et al., RetroEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-retroeval/2026.retroeval-main.4.pdf