Abstract
Offensive language identification has been an active area of research in natural language processing. With the emergence of multiple social media platforms offensive language identification has emerged as a need of the hour. Traditional offensive language identification models fail to deliver acceptable results as social media contents are largely in multilingual and are code-mixed in nature. This paper tries to resolve this problem by using IndicBERT and BERT architectures, to facilitate identification of offensive languages for Kannada-English, Malayalam-English, and Tamil-English code-mixed language pairs extracted from social media. The presented approach when evaluated on the test corpus provided precision, recall, and F1 score for language pair Kannada-English as 0.62, 0.71, and 0.66, respectively, for language pair Malayalam-English as 0.77, 0.43, and 0.53, respectively, and for Tamil-English as 0.71, 0.74, and 0.72, respectively.- Anthology ID:
- 2021.dravidianlangtech-1.46
- Volume:
- Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages
- Month:
- April
- Year:
- 2021
- Address:
- Kyiv
- Editors:
- Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Parameswari Krishnamurthy, Elizabeth Sherly
- Venue:
- DravidianLangTech
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 319–322
- Language:
- URL:
- https://aclanthology.org/2021.dravidianlangtech-1.46
- DOI:
- Cite (ACL):
- Avishek Garain, Atanu Mandal, and Sudip Kumar Naskar. 2021. JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, pages 319–322, Kyiv. Association for Computational Linguistics.
- Cite (Informal):
- JUNLP@DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Langauges (Garain et al., DravidianLangTech 2021)
- PDF:
- https://preview.aclanthology.org/emnlp-22-attachments/2021.dravidianlangtech-1.46.pdf