Dimensions of Abusive Language on Twitter

Isobelle Clarke, Jack Grieve


Abstract
In this paper, we use a new categorical form of multidimensional register analysis to identify the main dimensions of functional linguistic variation in a corpus of abusive language, consisting of racist and sexist Tweets. By analysing the use of a wide variety of parts-of-speech and grammatical constructions, as well as various features related to Twitter and computer-mediated communication, we discover three dimensions of linguistic variation in this corpus, which we interpret as being related to the degree of interactive, antagonistic and attitudinal language exhibited by individual Tweets. We then demonstrate that there is a significant functional difference between racist and sexist Tweets, with sexists Tweets tending to be more interactive and attitudinal than racist Tweets.
Anthology ID:
W17-3001
Volume:
Proceedings of the First Workshop on Abusive Language Online
Month:
August
Year:
2017
Address:
Vancouver, BC, Canada
Venue:
ALW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1–10
Language:
URL:
https://aclanthology.org/W17-3001
DOI:
10.18653/v1/W17-3001
Bibkey:
Cite (ACL):
Isobelle Clarke and Jack Grieve. 2017. Dimensions of Abusive Language on Twitter. In Proceedings of the First Workshop on Abusive Language Online, pages 1–10, Vancouver, BC, Canada. Association for Computational Linguistics.
Cite (Informal):
Dimensions of Abusive Language on Twitter (Clarke & Grieve, ALW 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/W17-3001.pdf