Designing Annotation Guidelines for Trait-Based Arabic Automated Essay Scoring: A Systematic Methodology

Walid Massoud, Houda Bouamor, Abdelrahman Abdel Latif Hussein, Abdullah Mohamed Mohamed Zekri


Abstract
Automated Essay Scoring (AES) fundamentally depends on high-quality annotated data, yet systematic approaches to developing annotation guidelines remain largely undocumented, especially for Arabic. We present a comprehensive methodology for trait-based Arabic AES annotation, applied to build a dataset of 7,859 essays by high school students annotated across seven writing traits, achieving substantial inter-annotator agreement (QWK: 0.66–0.75). Our methodology encompasses: (1) a seven-dimensional scoring framework grounded in Arabic linguistic and rhetorical conventions; (2) over 25 pages of Arabic-language guidelines with terminology unification, text-type-specific scoring descriptors, and annotated student examples; (3) a multi-stage training protocol that raised annotator agreement before production began; and (4) quality assurance mechanisms, including dual annotation and supervisor adjudication. We release all materials publicly, providing both a validated foundation for Arabic AES research and a replicable template for annotation guideline development in other morphologically complex, under-resourced languages.
Anthology ID:
2026.law-main.11
Volume:
Proceedings of the 20th Linguistic Annotation Workshop (LAW XX)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Yang Janet Liu, Luke Gessler
Venues:
LAW | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
146–157
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.law-main.11/
DOI:
Bibkey:
Cite (ACL):
Walid Massoud, Houda Bouamor, Abdelrahman Abdel Latif Hussein, and Abdullah Mohamed Mohamed Zekri. 2026. Designing Annotation Guidelines for Trait-Based Arabic Automated Essay Scoring: A Systematic Methodology. In Proceedings of the 20th Linguistic Annotation Workshop (LAW XX), pages 146–157, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Designing Annotation Guidelines for Trait-Based Arabic Automated Essay Scoring: A Systematic Methodology (Massoud et al., LAW 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.law-main.11.pdf