AyahVerse at AbjadGenEval Shared Task: Monolingual Precision and Cross-Lingual Analysis in Perso-Arabic AI Detection
Fizza Nawaz, Ibad-ur-Rehman Rashid, Uswa Abid, Junaid Hussain
Abstract
This paper presents our submission to the AbjadGenEval shared task on AI-generated text detection in Arabic and Urdu. To address the challenges of morphologically rich and low-resource environments, we developed a composite framework leveraging monolingual specialists (AraBERTv2, CAMeLBERT-DA) and multilingual transformers. Our system achieved robust in-domain performance with Test F1-scores of 0.75 for Arabic and 0.86 for Urdu. Methodologically, we tested both raw and normalized text to distinguish whether models detect based on semantic content or on surface artifacts such as punctuation and formatting patterns. Furthermore, our cross-lingual investigations reveal directional performance differences, where Urdu-trained models achieve 0.75 F1 on Arabic, while Arabic-trained models achieve only 0.61 F1 on Urdu. Despite this difference, both directions maintained notably high recall for the machine class, indicating that the model learns cross-lingual machine detection patterns across the Perso-Arabic script. Finally, transfer performance collapsed when internal layers were frozen, demonstrating that full fine-tuning is essential for cross-lingual detection. However, the observed performance differences may partly reflect data imbalance rather than purely linguistic factors.- Anthology ID:
- 2026.abjadnlp-1.63
- Volume:
- Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Morocco
- Venues:
- AbjadNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 497–505
- Language:
- URL:
- https://preview.aclanthology.org/manual-author-scripts/2026.abjadnlp-1.63/
- DOI:
- Cite (ACL):
- Fizza Nawaz, Ibad-ur-Rehman Rashid, Uswa Abid, and Junaid Hussain. 2026. AyahVerse at AbjadGenEval Shared Task: Monolingual Precision and Cross-Lingual Analysis in Perso-Arabic AI Detection. In Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script, pages 497–505, Rabat, Morocco. Association for Computational Linguistics.
- Cite (Informal):
- AyahVerse at AbjadGenEval Shared Task: Monolingual Precision and Cross-Lingual Analysis in Perso-Arabic AI Detection (Nawaz et al., AbjadNLP 2026)
- PDF:
- https://preview.aclanthology.org/manual-author-scripts/2026.abjadnlp-1.63.pdf