Abstract
We examine the task of detecting implicitly abusive comparisons (e.g. “Your hair looks like you have been electrocuted”). Implicitly abusive comparisons are abusive comparisons in which abusive words (e.g. “dumbass” or “scum”) are absent. We detail the process of creating a novel dataset for this task via crowdsourcing that includes several measures to obtain a sufficiently representative and unbiased set of comparisons. We also present classification experiments that include a range of linguistic features that help us better understand the mechanisms underlying abusive comparisons.- Anthology ID:
- 2021.eacl-main.27
- Volume:
- Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
- Month:
- April
- Year:
- 2021
- Address:
- Online
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 358–368
- Language:
- URL:
- https://aclanthology.org/2021.eacl-main.27
- DOI:
- 10.18653/v1/2021.eacl-main.27
- Cite (ACL):
- Michael Wiegand, Maja Geulig, and Josef Ruppenhofer. 2021. Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 358–368, Online. Association for Computational Linguistics.
- Cite (Informal):
- Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis (Wiegand et al., EACL 2021)
- PDF:
- https://preview.aclanthology.org/starsem-semeval-split/2021.eacl-main.27.pdf
- Code
- miwieg/implicitly_abusive_comparisons