A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets
R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, Sudev Basti
Abstract
This paper presents a novel system developed for the Faux-Hate Shared Task at ICON2024, addressing the detection of hate speechand fake narratives within Hindi-English code-mixed social media data. Our approach com-bines advanced text preprocessing, TF-IDFvectorization, and Random Forest classifiersto identify harmful content, while employingSMOTE to address class imbalance. By lever-aging ensemble learning and feature engineer-ing, our system demonstrates robust perfor-mance in detecting hateful and fake content,classifying targets, and evaluating the sever-ity of hate speech. The results underscore thepotential for real-world applications, such asmoderating online platforms and identifyingharmful narratives. Furthermore, we highlightethical considerations for deploying such tools,emphasizing responsible use in sensitive do-mains, thereby advancing research in multilin-gual hate speech detection and online abusemitigation.- Anthology ID:
- 2024.icon-fauxhate.8
- Volume:
- Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate)
- Month:
- December
- Year:
- 2024
- Address:
- AU-KBC Research Centre, Chennai, India
- Editors:
- Shankar Biradar, Kasu Sai Kartheek Reddy, Sunil Saumya, Md. Shad Akhtar
- Venue:
- ICON
- SIG:
- SIGLEX
- Publisher:
- NLP Association of India (NLPAI)
- Note:
- Pages:
- 40–44
- Language:
- URL:
- https://preview.aclanthology.org/fix-sig-urls/2024.icon-fauxhate.8/
- DOI:
- Cite (ACL):
- R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, and Sudev Basti. 2024. A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets. In Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate), pages 40–44, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
- Cite (Informal):
- A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets (Yadawad et al., ICON 2024)
- PDF:
- https://preview.aclanthology.org/fix-sig-urls/2024.icon-fauxhate.8.pdf