JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges

Avishek Garain; Atanu Mandal; Sudip Kumar Naskar

JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges

Avishek Garain, Atanu Mandal, Sudip Kumar Naskar

Abstract

Offensive language identification has been an active area of research in natural language processing. With the emergence of multiple social media platforms offensive language identification has emerged as a need of the hour. Traditional offensive language identification models fail to deliver acceptable results as social media contents are largely in multilingual and are code-mixed in nature. This paper tries to resolve this problem by using IndicBERT and BERT architectures, to facilitate identification of offensive languages for Kannada-English, Malayalam-English, and Tamil-English code-mixed language pairs extracted from social media. The presented approach when evaluated on the test corpus provided precision, recall, and F1 score for language pair Kannada-English as 0.62, 0.71, and 0.66, respectively, for language pair Malayalam-English as 0.77, 0.43, and 0.53, respectively, and for Tamil-English as 0.71, 0.74, and 0.72, respectively.

Anthology ID:: 2021.dravidianlangtech-1.46
Volume:: Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages
Month:: April
Year:: 2021
Address:: Kyiv
Editors:: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Parameswari Krishnamurthy, Elizabeth Sherly
Venue:: DravidianLangTech
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 319–322
Language:
URL:: https://preview.aclanthology.org/add-emnlp-2024-awards/2021.dravidianlangtech-1.46/
DOI:
Bibkey:
Cite (ACL):: Avishek Garain, Atanu Mandal, and Sudip Kumar Naskar. 2021. JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, pages 319–322, Kyiv. Association for Computational Linguistics.
Cite (Informal):: JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges (Garain et al., DravidianLangTech 2021)
Copy Citation:
PDF:: https://preview.aclanthology.org/add-emnlp-2024-awards/2021.dravidianlangtech-1.46.pdf

PDF Cite Search Fix data