Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection

Jaehoon Kim; Seungwan Jin; Sohyun Park; Someen Park; Kyungsik Han

doi:10.18653/v1/2024.findings-acl.957

Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection

Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, Kyungsik Han

Abstract

Detecting implicit hate speech that is not directly hateful remains a challenge. Recent research has attempted to detect implicit hate speech by applying contrastive learning to pre-trained language models such as BERT and RoBERTa, but the proposed models still do not have a significant advantage over cross-entropy loss-based learning. We found that contrastive learning based on randomly sampled batch data does not encourage the model to learn hard negative samples. In this work, we propose Label-aware Hard Negative sampling strategies (LAHN) that encourage the model to learn detailed features from hard negative samples, instead of naive negative samples in random batch, using momentum-integrated contrastive learning. LAHN outperforms the existing models for implicit hate speech detection both in- and cross-datasets. The code is available at https://github.com/Hanyang-HCC-Lab/LAHN

Anthology ID:: 2024.findings-acl.957
Volume:: Findings of the Association for Computational Linguistics: ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16177–16188
Language:
URL:: https://aclanthology.org/2024.findings-acl.957
DOI:: 10.18653/v1/2024.findings-acl.957
Bibkey:
Cite (ACL):: Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, and Kyungsik Han. 2024. Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection. In Findings of the Association for Computational Linguistics: ACL 2024, pages 16177–16188, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection (Kim et al., Findings 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/autopr/2024.findings-acl.957.pdf

PDF Search Fix metadata