Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words
Vivian Stamou, Iakovi Alexiou, Antigone Klimi, Eleftheria Molou, Alexandra Saivanidou, Stella Markantonatou
Abstract
We present a cleansed version of the multilingual lexicon HURTLEX-(EL) comprising 737 offensive words of Modern Greek. We worked bottom-up in two annotation rounds and developed detailed guidelines by cross-classifying words on three dimensions: context, reference, and thematic domain. Our classification reveals a wider spectrum of thematic domains concerning the study of offensive language than previously thought Efthymiou et al. (2014) and reveals social and cultural aspects that are not included in the HURTLEX categories.- Anthology ID:
- 2022.woah-1.10
- Volume:
- Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH)
- Month:
- July
- Year:
- 2022
- Address:
- Seattle, Washington (Hybrid)
- Venue:
- WOAH
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 102–108
- Language:
- URL:
- https://aclanthology.org/2022.woah-1.10
- DOI:
- 10.18653/v1/2022.woah-1.10
- Cite (ACL):
- Vivian Stamou, Iakovi Alexiou, Antigone Klimi, Eleftheria Molou, Alexandra Saivanidou, and Stella Markantonatou. 2022. Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words. In Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH), pages 102–108, Seattle, Washington (Hybrid). Association for Computational Linguistics.
- Cite (Informal):
- Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words (Stamou et al., WOAH 2022)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2022.woah-1.10.pdf