Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying

Rachele Sprugnoli, Stefano Menini, Sara Tonelli, Filippo Oncini, Enrico Piras


Abstract
Although WhatsApp is used by teenagers as one major channel of cyberbullying, such interactions remain invisible due to the app privacy policies that do not allow ex-post data collection. Indeed, most of the information on these phenomena rely on surveys regarding self-reported data. In order to overcome this limitation, we describe in this paper the activities that led to the creation of a WhatsApp dataset to study cyberbullying among Italian students aged 12-13. We present not only the collected chats with annotations about user role and type of offense, but also the living lab created in a collaboration between researchers and schools to monitor and analyse cyberbullying. Finally, we discuss some open issues, dealing with ethical, operational and epistemic aspects.
Anthology ID:
W18-5107
Volume:
Proceedings of the 2nd Workshop on Abusive Language Online (ALW2)
Month:
October
Year:
2018
Address:
Brussels, Belgium
Venue:
ALW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
51–59
Language:
URL:
https://aclanthology.org/W18-5107
DOI:
10.18653/v1/W18-5107
Bibkey:
Cite (ACL):
Rachele Sprugnoli, Stefano Menini, Sara Tonelli, Filippo Oncini, and Enrico Piras. 2018. Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying. In Proceedings of the 2nd Workshop on Abusive Language Online (ALW2), pages 51–59, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Creating a WhatsApp Dataset to Study Pre-teen Cyberbullying (Sprugnoli et al., ALW 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/W18-5107.pdf