Abusive Language Recognition in Russian

Kamil Saitov, Leon Derczynski


Abstract
Abusive phenomena are commonplace in language on the web. The scope of recognizing abusive language is broad, covering many behaviors and forms of expression. This work addresses automatic detection of abusive language in Russian. The lexical, grammatical and morphological diversity of Russian language present potential difficulties for this task, which is addressed using a variety of machine learning approaches. Finally, competitive performance is reached over multiple domains for this investigation into automatic detection of abusive language in Russian.
Anthology ID:
2021.bsnlp-1.3
Volume:
Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing
Month:
April
Year:
2021
Address:
Kiyv, Ukraine
Venue:
BSNLP
SIG:
SIGSLAV
Publisher:
Association for Computational Linguistics
Note:
Pages:
20–25
Language:
URL:
https://aclanthology.org/2021.bsnlp-1.3
DOI:
Bibkey:
Cite (ACL):
Kamil Saitov and Leon Derczynski. 2021. Abusive Language Recognition in Russian. In Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pages 20–25, Kiyv, Ukraine. Association for Computational Linguistics.
Cite (Informal):
Abusive Language Recognition in Russian (Saitov & Derczynski, BSNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.bsnlp-1.3.pdf
Code
 sariellee/russan-hate-speech-recognition
Data
IPM NEL