Abstract
In recent years, we witnessed great progress in different tasks of natural language understanding using machine learning. Question answering is one of these tasks which is used by search engines and social media platforms for improved user experience. Arabic is the language of the Holy Qur’an; the sacred text for 1.8 billion people across the world. Arabic is a challenging language for Natural Language Processing (NLP) due to its complex structures. In this article, we describe our attempts at OSACT5 Qur’an QA 2022 Shared Task, which is a question answering challenge on the Holy Qur’an in Arabic. We propose an ensemble learning model based on Arabic variants of BERT models. In addition, we perform post-processing to enhance the model predictions. Our system achieves a Partial Reciprocal Rank (pRR) score of 56.6% on the official test set.- Anthology ID:
- 2022.osact-1.19
- Volume:
- Proceedinsg of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur'an QA and Fine-Grained Hate Speech Detection
- Month:
- June
- Year:
- 2022
- Address:
- Marseille, France
- Editors:
- Hend Al-Khalifa, Tamer Elsayed, Hamdy Mubarak, Abdulmohsen Al-Thubaity, Walid Magdy, Kareem Darwish
- Venue:
- OSACT
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 154–161
- Language:
- URL:
- https://aclanthology.org/2022.osact-1.19
- DOI:
- Cite (ACL):
- Mohamemd Elkomy and Amany M. Sarhan. 2022. TCE at Qur’an QA 2022: Arabic Language Question Answering Over Holy Qur’an Using a Post-Processed Ensemble of BERT-based Models. In Proceedinsg of the 5th Workshop on Open-Source Arabic Corpora and Processing Tools with Shared Tasks on Qur'an QA and Fine-Grained Hate Speech Detection, pages 154–161, Marseille, France. European Language Resources Association.
- Cite (Informal):
- TCE at Qur’an QA 2022: Arabic Language Question Answering Over Holy Qur’an Using a Post-Processed Ensemble of BERT-based Models (Elkomy & Sarhan, OSACT 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2022.osact-1.19.pdf
- Data
- SQuAD