REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection

Mariam Labib; Nsrin Ashraf; Ahmed M. Fetouh; Hamada Nayel

doi:10.18653/v1/2026.abjadnlp-1.62

REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection

Mariam Labib, Nsrin Ashraf, Ahmed M. Fetouh, Hamada Nayel

Abstract

The rapid advancement of large language models necessitates robust methods for detecting AI-generated Arabic text. This paper presents our system for distinguishing human-written from machine-generated Arabic content. We propose a weighted ensemble combining AraBERTv2 and BERT-base-arabic, trained via 5-fold stratified cross-validation with class-balanced loss functions. Our methodology incorporates Arabic text normalization, strategic data augmentation using 16,678 samples from external scientific abstracts, and threshold optimization prioritizing recall. On the official test set, our system achieved an F1-score of 0.763, an accuracy of 0.695, a precision of 0.624, and a recall of 0.980, demonstrating strong detection of machine-generated texts with minimal false negatives at the cost of elevated false positives. Analysis reveals critical insights into precision-recall trade-offs and challenges in cross-domain generalization for Arabic AI text detection.

Anthology ID:: 2026.abjadnlp-1.62
Volume:: Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Mo El-Haj, Paul Rayson, Mustafa Jarrar, Ignatius Ezeani, Saad Ezzini, Sina Ahmadi, Amal Haddad Haddad, Cynthia Amol, Ahmad Abdelali, Shadi Abudalfa
Venues:: AbjadNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 493–496
Language:
URL:: https://preview.aclanthology.org/ingest-nejlt/2026.abjadnlp-1.62/
DOI:: 10.18653/v1/2026.abjadnlp-1.62
Bibkey:
Cite (ACL):: Mariam Labib, Nsrin Ashraf, Ahmed M. Fetouh, and Hamada Nayel. 2026. REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection. In Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script, pages 493–496, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection (Labib et al., AbjadNLP 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-nejlt/2026.abjadnlp-1.62.pdf

PDF Cite Search Fix data