Abstract
Abusive phenomena are commonplace in language on the web. The scope of recognizing abusive language is broad, covering many behaviors and forms of expression. This work addresses automatic detection of abusive language in Russian. The lexical, grammatical and morphological diversity of Russian language present potential difficulties for this task, which is addressed using a variety of machine learning approaches. Finally, competitive performance is reached over multiple domains for this investigation into automatic detection of abusive language in Russian.- Anthology ID:
- 2021.bsnlp-1.3
- Volume:
- Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing
- Month:
- April
- Year:
- 2021
- Address:
- Kiyv, Ukraine
- Editors:
- Bogdan Babych, Olga Kanishcheva, Preslav Nakov, Jakub Piskorski, Lidia Pivovarova, Vasyl Starko, Josef Steinberger, Roman Yangarber, Michał Marcińczuk, Senja Pollak, Pavel Přibáň, Marko Robnik-Šikonja
- Venue:
- BSNLP
- SIG:
- SIGSLAV
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 20–25
- Language:
- URL:
- https://aclanthology.org/2021.bsnlp-1.3
- DOI:
- Cite (ACL):
- Kamil Saitov and Leon Derczynski. 2021. Abusive Language Recognition in Russian. In Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing, pages 20–25, Kiyv, Ukraine. Association for Computational Linguistics.
- Cite (Informal):
- Abusive Language Recognition in Russian (Saitov & Derczynski, BSNLP 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2021.bsnlp-1.3.pdf
- Code
- sariellee/russan-hate-speech-recognition
- Data
- IPM NEL