Simon @ DravidianLangTech-EACL2021: Detecting Offensive Content in Kannada Language

Qinyu Que


Abstract
This article introduces the system for the shared task of Offensive Language Identification in Dravidian Languages-EACL 2021. The world’s information technology develops at a high speed. People are used to expressing their views and opinions on social media. This leads to a lot of offensive language on social media. As people become more dependent on social media, the detection of offensive language becomes more and more necessary. This shared task is in three languages: Tamil, Malayalam, and Kannada. Our team takes part in the Kannada language task. To accomplish the task, we use the XLM-Roberta model for pre-training. But the capabilities of the XLM-Roberta model do not satisfy us in terms of statement information collection. So we made some tweaks to the output of this model. In this paper, we describe the models and experiments for accomplishing the task of the Kannada language.
Anthology ID:
2021.dravidianlangtech-1.20
Volume:
Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages
Month:
April
Year:
2021
Address:
Kyiv
Editors:
Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Parameswari Krishnamurthy, Elizabeth Sherly
Venue:
DravidianLangTech
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
160–163
Language:
URL:
https://aclanthology.org/2021.dravidianlangtech-1.20
DOI:
Bibkey:
Cite (ACL):
Qinyu Que. 2021. Simon @ DravidianLangTech-EACL2021: Detecting Offensive Content in Kannada Language. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, pages 160–163, Kyiv. Association for Computational Linguistics.
Cite (Informal):
Simon @ DravidianLangTech-EACL2021: Detecting Offensive Content in Kannada Language (Que, DravidianLangTech 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.dravidianlangtech-1.20.pdf
Software:
 2021.dravidianlangtech-1.20.Software.zip