indiDataMiner at SemEval-2025 Task 11: From Text to Emotion: Transformer-Based Models for Emotions Detection in Indian Languages

Saurabh Kumar, Sujit Kumar, Sanasam Ranbir Singh, Sukumar Nandi


Abstract
Emotion detection is essential for applications like mental health monitoring and social media analysis, yet remains underexplored for Indian languages. This paper presents our system for SemEval-2025 Task 11 (Track A), focusing on multilabel emotion detection in Hindi and Marathi, two widely spoken Indian languages. We fine-tune IndicBERT v2 on the BRIGHTER dataset, achieving F1 scores of 87.37 (Hindi) and 88.32 (Marathi), outperforming baseline models. Our results highlight the effectiveness of fine-tuning a language-specific pretrained model for emotion detection, contributing to advancements in multilingual NLP research.
Anthology ID:
2025.semeval-1.262
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2020–2027
Language:
URL:
https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.262/
DOI:
Bibkey:
Cite (ACL):
Saurabh Kumar, Sujit Kumar, Sanasam Ranbir Singh, and Sukumar Nandi. 2025. indiDataMiner at SemEval-2025 Task 11: From Text to Emotion: Transformer-Based Models for Emotions Detection in Indian Languages. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 2020–2027, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
indiDataMiner at SemEval-2025 Task 11: From Text to Emotion: Transformer-Based Models for Emotions Detection in Indian Languages (Kumar et al., SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.262.pdf