DLRG@TamilNLP-ACL2022: Offensive Span Identification in Tamil usingBiLSTM-CRF approach

Ratnavel Rajalakshmi, Mohit More, Bhamatipati Shrikriti, Gitansh Saharan, Hanchate Samyuktha, Sayantan Nandy


Abstract
Identifying offensive speech is an exciting andessential area of research, with ample tractionin recent times. This paper presents our sys-tem submission to the subtask 1, focusing onusing supervised approaches for extracting Of-fensive spans from code-mixed Tamil-Englishcomments. To identify offensive spans, wedeveloped the Bidirectional Long Short-TermMemory (BiLSTM) model with Glove Em-bedding. To this end, the developed systemachieved an overall F1 of 0.1728. Addition-ally, for comments with less than 30 characters,the developed system shows an F1 of 0.3890,competitive with other submissions.
Anthology ID:
2022.dravidianlangtech-1.38
Volume:
Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
DravidianLangTech
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
248–253
Language:
URL:
https://aclanthology.org/2022.dravidianlangtech-1.38
DOI:
10.18653/v1/2022.dravidianlangtech-1.38
Bibkey:
Cite (ACL):
Ratnavel Rajalakshmi, Mohit More, Bhamatipati Shrikriti, Gitansh Saharan, Hanchate Samyuktha, and Sayantan Nandy. 2022. DLRG@TamilNLP-ACL2022: Offensive Span Identification in Tamil usingBiLSTM-CRF approach. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages, pages 248–253, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
DLRG@TamilNLP-ACL2022: Offensive Span Identification in Tamil usingBiLSTM-CRF approach (Rajalakshmi et al., DravidianLangTech 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.dravidianlangtech-1.38.pdf