Impact of Politically Biased Data on Hate Speech Classification

Maximilian Wich, Jan Bauer, Georg Groh


Abstract
One challenge that social media platforms are facing nowadays is hate speech. Hence, automatic hate speech detection has been increasingly researched in recent years - in particular with the rise of deep learning. A problem of these models is their vulnerability to undesirable bias in training data. We investigate the impact of political bias on hate speech classification by constructing three politically-biased data sets (left-wing, right-wing, politically neutral) and compare the performance of classifiers trained on them. We show that (1) political bias negatively impairs the performance of hate speech classifiers and (2) an explainable machine learning model can help to visualize such bias within the training data. The results show that political bias in training data has an impact on hate speech classification and can become a serious issue.
Anthology ID:
2020.alw-1.7
Volume:
Proceedings of the Fourth Workshop on Online Abuse and Harms
Month:
November
Year:
2020
Address:
Online
Editors:
Seyi Akiwowo, Bertie Vidgen, Vinodkumar Prabhakaran, Zeerak Waseem
Venue:
ALW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
54–64
Language:
URL:
https://aclanthology.org/2020.alw-1.7
DOI:
10.18653/v1/2020.alw-1.7
Bibkey:
Cite (ACL):
Maximilian Wich, Jan Bauer, and Georg Groh. 2020. Impact of Politically Biased Data on Hate Speech Classification. In Proceedings of the Fourth Workshop on Online Abuse and Harms, pages 54–64, Online. Association for Computational Linguistics.
Cite (Informal):
Impact of Politically Biased Data on Hate Speech Classification (Wich et al., ALW 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/2020.alw-1.7.pdf
Video:
 https://slideslive.com/38939519
Code
 mawic/political-bias-hate-speech