AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection

Yejin Lee, Joonghyuk Hahn, Hyeseon Ahn, Yo-Sub Han


Abstract
Implicit hate speech detection is challenging due to its subtlety and reliance on contextual interpretation rather than explicit offensive words. Current approaches rely on contrastive learning, which are shown to be effective on distinguishing hate and non-hate sentences. Humans, however, detect implicit hate speech by first identifying specific targets within the text and subsequently interpreting how these target relate to their surrounding context. Motivated by this reasoning process, we propose AmpleHate, a novel approach designed to mirror human inference for implicit hate detection. AmpleHate identifies explicit target using a pretrained Named Entity Recognition model and capture implicit target information via [CLS] tokens. It computes attention-based relationships between explicit, implicit targets and sentence context and then, directly injects these relational vectors into the final sentence representation. This amplifies the critical signals of target-context relations for determining implicit hate. Experiments demonstrate that AmpleHate achieves state-of-the-art performance, outperforming contrastive learning baselines by an average of 82.14% and achieve faster convergence. Qualitative analyses further reveal that attention patterns produced by AmpleHate closely align with human judgement, underscoring its interpretability and robustness.
Anthology ID:
2025.emnlp-main.1469
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
28850–28862
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1469/
DOI:
Bibkey:
Cite (ACL):
Yejin Lee, Joonghyuk Hahn, Hyeseon Ahn, and Yo-Sub Han. 2025. AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 28850–28862, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection (Lee et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1469.pdf
Checklist:
 2025.emnlp-main.1469.checklist.pdf