Feature Engineering is not Dead: A Step Towards State of the Art for Arabic Automated Essay Scoring
Marwan Sayed, Sohaila Eltanbouly, May Bashendy, Tamer Elsayed
Abstract
Automated Essay Scoring (AES) has shown significant advancements in educational assessment. However, under-resourced languages like Arabic have received limited attention. To bridge this gap and enable robust Arabic AES, this paper introduces the first publicly-available comprehensive set of engineered features tailored for Arabic AES, covering surface-level, readability, lexical, syntactic, and semantic features. Experiments are conducted on a dataset of 620 Arabic essays, each annotated with both holistic and trait-specific scores. Our findings demonstrate that the proposed feature set is effective across different models and competitive with recent NLP advances including LLMs, establishing the state-of-the-art performance and providing strong baselines for future Arabic AES research. Moroever, the resulting feature set offers a reusable and foundational resource, contributing towards the development of more effective Arabic AES systems.- Anthology ID:
- 2025.arabicnlp-main.19
- Volume:
- Proceedings of The Third Arabic Natural Language Processing Conference
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Kareem Darwish, Ahmed Ali, Ibrahim Abu Farha, Samia Touileb, Imed Zitouni, Ahmed Abdelali, Sharefah Al-Ghamdi, Sakhar Alkhereyf, Wajdi Zaghouani, Salam Khalifa, Badr AlKhamissi, Rawan Almatham, Injy Hamed, Zaid Alyafeai, Areeb Alowisheq, Go Inoue, Khalil Mrini, Waad Alshammari
- Venue:
- ArabicNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 231–245
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.arabicnlp-main.19/
- DOI:
- Cite (ACL):
- Marwan Sayed, Sohaila Eltanbouly, May Bashendy, and Tamer Elsayed. 2025. Feature Engineering is not Dead: A Step Towards State of the Art for Arabic Automated Essay Scoring. In Proceedings of The Third Arabic Natural Language Processing Conference, pages 231–245, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Feature Engineering is not Dead: A Step Towards State of the Art for Arabic Automated Essay Scoring (Sayed et al., ArabicNLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.arabicnlp-main.19.pdf