Contextual-Lexicon Approach for Abusive Language Detection

Francielle Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto, Thiago Pardo


Abstract
Since a lexicon-based approach is more elegant scientifically, explaining the solution components and being easier to generalize to other applications, this paper provides a new approach for offensive language and hate speech detection on social media, which embodies a lexicon of implicit and explicit offensive and swearing expressions annotated with contextual information. Due to the severity of the social media abusive comments in Brazil, and the lack of research in Portuguese, Brazilian Portuguese is the language used to validate the models. Nevertheless, our method may be applied to any other language. The conducted experiments show the effectiveness of the proposed approach, outperforming the current baseline methods for the Portuguese language.
Anthology ID:
2021.ranlp-1.161
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
1438–1447
Language:
URL:
https://aclanthology.org/2021.ranlp-1.161
DOI:
Bibkey:
Cite (ACL):
Francielle Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto, and Thiago Pardo. 2021. Contextual-Lexicon Approach for Abusive Language Detection. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 1438–1447, Held Online. INCOMA Ltd..
Cite (Informal):
Contextual-Lexicon Approach for Abusive Language Detection (Vargas et al., RANLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/2021.ranlp-1.161.pdf