RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection

Yejin Lee; Hyeseon An; Yo-Sub Han

RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection

Abstract

Hate speech remains prevalent in human society and continues to evolve in its forms and expressions. Modern advancements in the internet and online anonymity accelerate its rapid spread and complicate its detection. However, hate speech datasets exhibit diverse characteristics primarily because they are constructed from different sources and platforms, each reflecting different linguistic styles and social contexts. Despite this diversity, prior studies on hate speech detection often rely on fixed methodologies without adapting to data-specific features. We introduce RV-HATE, a detection framework designed to account for the dataset-specific characteristics of each hate speech dataset. RV-HATE consists of multiple specialized modules, where each module focuses on distinct linguistic or contextual features of hate speech. The framework employs reinforcement learning to optimize weights that determine the contribution of each module for a given dataset. A voting mechanism then aggregates the module outputs to produce the final decision. RV-HATE offers two primary advantages: (1) it improves detection accuracy by tailoring the detection process to dataset-specific attributes, and (2) it also provides interpretable insights into the distinctive features of each dataset. Consequently, our approach effectively addresses implicit hate speech and achieves superior performance compared to conventional static methods.

Anthology ID:: 2026.acl-long.2104
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 45364–45383
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2104/
DOI:
Bibkey:
Cite (ACL):: Yejin Lee, Hyeseon An, and Yo-Sub Han. 2026. RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 45364–45383, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech Detection (Lee et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.2104.pdf
Checklist:: 2026.acl-long.2104.checklist.pdf

PDF Cite Search Checklist Fix data