AbhiPaw@ DravidianLangTech: Fake News Detection in Dravidian Languages using Multilingual BERT

Abhinaba Bala, Parameswari Krishnamurthy


Abstract
This study addresses the challenge of detecting fake news in Dravidian languages by leveraging Google’s MuRIL (Multilingual Representations for Indian Languages) model. Drawing upon previous research, we investigate the intricacies involved in identifying fake news and explore the potential of transformer-based models for linguistic analysis and contextual understanding. Through supervised learning, we fine-tune the “muril-base-cased” variant of MuRIL using a carefully curated dataset of labeled comments and posts in Dravidian languages, enabling the model to discern between original and fake news. During the inference phase, the fine-tuned MuRIL model analyzes new textual content, extracting contextual and semantic features to predict the content’s classification. We evaluate the model’s performance using standard metrics, highlighting the effectiveness of MuRIL in detecting fake news in Dravidian languages and contributing to the establishment of a safer digital ecosystem. Keywords: fake news detection, Dravidian languages, MuRIL, transformer-based models, linguistic analysis, contextual understanding.
Anthology ID:
2023.dravidianlangtech-1.34
Volume:
Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Bharathi R. Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Sajeetha Thavareesan, Elizabeth Sherly
Venues:
DravidianLangTech | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
235–238
Language:
URL:
https://aclanthology.org/2023.dravidianlangtech-1.34
DOI:
Bibkey:
Cite (ACL):
Abhinaba Bala and Parameswari Krishnamurthy. 2023. AbhiPaw@ DravidianLangTech: Fake News Detection in Dravidian Languages using Multilingual BERT. In Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages, pages 235–238, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
AbhiPaw@ DravidianLangTech: Fake News Detection in Dravidian Languages using Multilingual BERT (Bala & Krishnamurthy, DravidianLangTech-WS 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2023.dravidianlangtech-1.34.pdf