Heba Sbahi
2025
Assessing Large Language Models on Islamic Legal Reasoning: Evidence from Inheritance Law Evaluation
Abdessalam Bouchekif
|
Samer Rashwani
|
Heba Sbahi
|
Shahd Gaben
|
Mutaz Al Khatib
|
Mohammed Ghaly
Proceedings of The Third Arabic Natural Language Processing Conference
This paper evaluates the knowledge and reasoning capabilities of Large Language Models in Islamic inheritance law, ʿilm al-mawārīth. We assess the performance of seven LLMs using a benchmark of 1,000 multiple-choice questions covering diverse inheritance scenarios, designed to test each model’s ability—from understanding the inheritance context to computing the distribution of shares prescribed by Islamic jurisprudence. The results show a wide performance gap among models. o3 and Gemini 2.5 achieved accuracies above 90%, while ALLaM, Fanar, LLaMA, and Mistral scored below 50%. These disparities reflect important differences in reasoning ability and domain adaptation.We conduct a detailed error analysis to identify recurring failure patterns across models, including misunderstandings of inheritance scenarios, incorrect application of legal rules, and insufficient domain knowledge. Our findings highlight the limitations of current models in handling structured legal reasoning and suggest directions for improving their performance in Islamic legal reasoning.
QIAS 2025: Overview of the Shared Task on Islamic Inheritance Reasoning and Knowledge Assessment
Abdessalam Bouchekif
|
Samer Rashwani
|
Emad Soliman Ali Mohamed
|
Mutaz Alkhatib
|
Heba Sbahi
|
Shahd Gaben
|
Wajdi Zaghouani
|
Aiman Erbad
|
Mohammed Ghaly
Proceedings of The Third Arabic Natural Language Processing Conference: Shared Tasks
Search
Fix author
Co-authors
- Abdessalam Bouchekif 2
- Shahd Gaben 2
- Mohammed Ghaly 2
- Samer Rashwani 2
- Mutaz Al Khatib 1
- show all...