A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets

R.n. Yadawad; Sunil Saumya; K.n. Nivedh; Siddhaling S. Padanur; Sudev Basti

A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets

R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, Sudev Basti

Abstract

This paper presents a novel system developed for the Faux-Hate Shared Task at ICON2024, addressing the detection of hate speechand fake narratives within Hindi-English code-mixed social media data. Our approach com-bines advanced text preprocessing, TF-IDFvectorization, and Random Forest classifiersto identify harmful content, while employingSMOTE to address class imbalance. By lever-aging ensemble learning and feature engineer-ing, our system demonstrates robust perfor-mance in detecting hateful and fake content,classifying targets, and evaluating the sever-ity of hate speech. The results underscore thepotential for real-world applications, such asmoderating online platforms and identifyingharmful narratives. Furthermore, we highlightethical considerations for deploying such tools,emphasizing responsible use in sensitive do-mains, thereby advancing research in multilin-gual hate speech detection and online abusemitigation.

Anthology ID:: 2024.icon-fauxhate.8
Volume:: Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate)
Month:: December
Year:: 2024
Address:: AU-KBC Research Centre, Chennai, India
Editors:: Shankar Biradar, Kasu Sai Kartheek Reddy, Sunil Saumya, Md. Shad Akhtar
Venue:: ICON
SIG:: SIGLEX
Publisher:: NLP Association of India (NLPAI)
Note:
Pages:: 40–44
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2024.icon-fauxhate.8/
DOI:
Bibkey:
Cite (ACL):: R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, and Sudev Basti. 2024. A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets. In Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate), pages 40–44, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):: A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets (Yadawad et al., ICON 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2024.icon-fauxhate.8.pdf

PDF Cite Search Fix data