Abstract
Abusive comments in online platforms have become a significant concern, necessitating the development of effective detection systems. However, limited work has been done in low resource languages, including Dravidian languages. This paper addresses this gap by focusing on abusive comment detection in a dataset containing Tamil, Tamil-English and Telugu-English code-mixed comments. Our methodology involves logistic regression and explores suitable embeddings to enhance the performance of the detection model. Through rigorous experimentation, we identify the most effective combination of logistic regression and embeddings. The results demonstrate the performance of our proposed model, which contributes to the development of robust abusive comment detection systems in low resource language settings. Keywords: Abusive comment detection, Dravidian languages, logistic regression, embeddings, low resource languages, code-mixed dataset.- Anthology ID:
- 2023.dravidianlangtech-1.33
- Volume:
- Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages
- Month:
- September
- Year:
- 2023
- Address:
- Varna, Bulgaria
- Editors:
- Bharathi R. Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Sajeetha Thavareesan, Elizabeth Sherly
- Venues:
- DravidianLangTech | WS
- SIG:
- Publisher:
- INCOMA Ltd., Shoumen, Bulgaria
- Note:
- Pages:
- 231–234
- Language:
- URL:
- https://aclanthology.org/2023.dravidianlangtech-1.33
- DOI:
- Cite (ACL):
- Abhinaba Bala and Parameswari Krishnamurthy. 2023. AbhiPaw@ DravidianLangTech: Abusive Comment Detection in Tamil and Telugu using Logistic Regression. In Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages, pages 231–234, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
- Cite (Informal):
- AbhiPaw@ DravidianLangTech: Abusive Comment Detection in Tamil and Telugu using Logistic Regression (Bala & Krishnamurthy, DravidianLangTech-WS 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-1/2023.dravidianlangtech-1.33.pdf