Offence in Dialogues: A Corpus-Based Study

Johannes Schäfer, Ben Burtenshaw


Abstract
In recent years an increasing number of analyses of offensive language has been published, however, dealing mainly with the automatic detection and classification of isolated instances. In this paper we aim to understand the impact of offensive messages in online conversations diachronically, and in particular the change in offensiveness of dialogue turns. In turn, we aim to measure the progression of offence level as well as its direction - For example, whether a conversation is escalating or declining in offence. We present our method of extracting linear dialogues from tree-structured conversations in social media data and make our code publicly available. Furthermore, we discuss methods to analyse this dataset through changes in discourse offensiveness. Our paper includes two main contributions; first, using a neural network to measure the level of offensiveness in conversations; and second, the analysis of conversations around offensive comments using decoupling functions.
Anthology ID:
R19-1125
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
Month:
September
Year:
2019
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
1085–1093
Language:
URL:
https://aclanthology.org/R19-1125
DOI:
10.26615/978-954-452-056-4_125
Bibkey:
Cite (ACL):
Johannes Schäfer and Ben Burtenshaw. 2019. Offence in Dialogues: A Corpus-Based Study. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 1085–1093, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Offence in Dialogues: A Corpus-Based Study (Schäfer & Burtenshaw, RANLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/R19-1125.pdf