Beyond Annotator Disagreement: Guideline-Induced Errors in Arabic Hate Speech Annotation

Wajdi Zaghouani

Beyond Annotator Disagreement: Guideline-Induced Errors in Arabic Hate Speech Annotation

Abstract

Annotation errors in hate speech corpora are often attributed to annotator disagreement or bias. This paper argues that a substantial and underexamined class of errors originates upstream, from structural weaknesses in annotation guidelines themselves. When guidelines fail to encode the linguistic and cultural properties of the target discourse, they make certain errors structurally inevitable regardless of annotator quality. Focusing on Arabic social media discourse, a challenging setting due to its dialect continuum, culturally embedded insult conventions, sarcasm-heavy pragmatics, and complex religious rhetoric, we identify three mechanisms through which guideline design produces systematic annotation errors: cultural misclassification, when culturally specific hostile expressions fall outside annotation categories; dialectal ambiguity, when lexical meanings shift across regional varieties; and annotation projection, when frameworks developed for English moderation are applied to Arabic without adequate adaptation. Using six illustrative case studies with attested Arabic examples, we show how these mechanisms produce recurrent misannotations in existing datasets. We propose a taxonomy of five guideline-induced error types, an explicit mapping from mechanisms to error types, and a practical four-stage diagnostic framework for dataset builders.

Anthology ID:: 2026.law-main.5
Volume:: Proceedings of the 20th Linguistic Annotation Workshop (LAW XX)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yang Janet Liu, Luke Gessler
Venues:: LAW | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 47–58
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.law-main.5/
DOI:
Bibkey:
Cite (ACL):: Wajdi Zaghouani. 2026. Beyond Annotator Disagreement: Guideline-Induced Errors in Arabic Hate Speech Annotation. In Proceedings of the 20th Linguistic Annotation Workshop (LAW XX), pages 47–58, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Beyond Annotator Disagreement: Guideline-Induced Errors in Arabic Hate Speech Annotation (Zaghouani, LAW 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.law-main.5.pdf

PDF Cite Search Fix data