Improving Hate Speech Detection by Fusing Textual and User Interaction Representations in Online Communities

Xu Gao, Dong Jing, Kee-hung Lai


Abstract
Detecting hate speech in online communities is increasingly challenging due to the implicit and context-dependent nature of toxic expressions. While text-only models often struggle with such ambiguity, incorporating user interaction signals offers critical pragmatic context for disambiguation. However, research in this direction is hindered by the scarcity of datasets that align textual content with comprehensive user behavioral graphs. To bridge this gap, we present a new dataset collected from a real-world community, featuring labeled hate speech enriched with fine-grained interaction histories. We further propose a novel user-aware hate speech detection framework that effectively fuses textual semantics with social interaction representations. Experiments demonstrate that our approach consistently outperforms strong text-only baselines by over 3.6%, validating the critical role of social context in enhancing detection accuracy. Furthermore, to mitigate real-world adversarial risks such as graph spoofing and spam, we introduce a contrastive graph augmentation strategy, ensuring model robustness against unreliable community behaviors.
Anthology ID:
2026.acl-industry.6
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Yunyao Li, Georg Rehm, Mei Tu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
76–87
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.6/
DOI:
Bibkey:
Cite (ACL):
Xu Gao, Dong Jing, and Kee-hung Lai. 2026. Improving Hate Speech Detection by Fusing Textual and User Interaction Representations in Online Communities. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 76–87, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Improving Hate Speech Detection by Fusing Textual and User Interaction Representations in Online Communities (Gao et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.6.pdf