Web(er) of Hate: A Survey on How Hate Speech Is Typed

Luna Wang, Andrew Caines, Alice Hutchings


Abstract
The curation of hate speech datasets involves complex design decisions that balance competing priorities. This paper critically examines these methodological choices in a diverse range of datasets, highlighting common themes and practices, and their implications for dataset reliability. Drawing on Max Weber’s notion of ideal types, we argue for a reflexive approach in dataset creation, urging researchers to acknowledge their own value judgments during dataset construction, fostering transparency and methodological rigour.
Anthology ID:
2025.woah-1.9
Volume:
Proceedings of the The 9th Workshop on Online Abuse and Harms (WOAH)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Agostina Calabrese, Christine de Kock, Debora Nozza, Flor Miriam Plaza-del-Arco, Zeerak Talat, Francielle Vargas
Venues:
WOAH | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
77–103
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.woah-1.9/
DOI:
Bibkey:
Cite (ACL):
Luna Wang, Andrew Caines, and Alice Hutchings. 2025. Web(er) of Hate: A Survey on How Hate Speech Is Typed. In Proceedings of the The 9th Workshop on Online Abuse and Harms (WOAH), pages 77–103, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Web(er) of Hate: A Survey on How Hate Speech Is Typed (Wang et al., WOAH 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.woah-1.9.pdf
Supplementarymaterial:
 2025.woah-1.9.SupplementaryMaterial.zip