Investigating Annotator Bias with a Graph-Based Approach

Maximilian Wich, Hala Al Kuwatly, Georg Groh


Abstract
A challenge that many online platforms face is hate speech or any other form of online abuse. To cope with this, hate speech detection systems are developed based on machine learning to reduce manual work for monitoring these platforms. Unfortunately, machine learning is vulnerable to unintended bias in training data, which could have severe consequences, such as a decrease in classification performance or unfair behavior (e.g., discriminating minorities). In the scope of this study, we want to investigate annotator bias — a form of bias that annotators cause due to different knowledge in regards to the task and their subjective perception. Our goal is to identify annotation bias based on similarities in the annotation behavior from annotators. To do so, we build a graph based on the annotations from the different annotators, apply a community detection algorithm to group the annotators, and train for each group classifiers whose performances we compare. By doing so, we are able to identify annotator bias within a data set. The proposed method and collected insights can contribute to developing fairer and more reliable hate speech classification models.
Anthology ID:
2020.alw-1.22
Original:
2020.alw-1.22v1
Version 2:
2020.alw-1.22v2
Volume:
Proceedings of the Fourth Workshop on Online Abuse and Harms
Month:
November
Year:
2020
Address:
Online
Venue:
ALW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
191–199
Language:
URL:
https://aclanthology.org/2020.alw-1.22
DOI:
10.18653/v1/2020.alw-1.22
Bibkey:
Cite (ACL):
Maximilian Wich, Hala Al Kuwatly, and Georg Groh. 2020. Investigating Annotator Bias with a Graph-Based Approach. In Proceedings of the Fourth Workshop on Online Abuse and Harms, pages 191–199, Online. Association for Computational Linguistics.
Cite (Informal):
Investigating Annotator Bias with a Graph-Based Approach (Wich et al., ALW 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.alw-1.22.pdf
Optional supplementary material:
 2020.alw-1.22.OptionalSupplementaryMaterial.zip
Video:
 https://slideslive.com/38939539
Code
 mawic/graph-based-method-annotator-bias