TamilEcho_Political@DravidianLangTech 2026: Hybrid XLM-RoBERTa with Sarcasm-Aware Feature Fusion for Political Multiclass Sentiment Analysis in Tamil X
Kanimozhi Selvi C S, Inigashree N S, Kavinraj J, Moneissh A G
Abstract
Political sentiment analysis in Tamil social media is challenging due to informal language, sarcasm, emoji-driven sentiment inversion, and severe class imbalance. This paper presents TamilEcho, our system submitted to the Shared Task on Political Multiclass Sentiment Analysis of Tamil X (Twitter) Comments at DravidianLangTech@ACL 2026. We propose a hybrid architecture that integrates contextual representations from XLM-RoBERTa with lexical TF-IDF features and explicit sarcasm-aware emoji features. Domain-specific hashtag expansion is incorporated to enrich political context. To address class imbalance, we apply inverse-frequency class weighting and label smoothing during training. Experimental results demonstrate that hybrid feature fusion significantly improves performance over transformer-only baselines. Our final system achieves a Macro-F1 score of 0.3559 on the official test set, securing Rank 10 among participating teams. The results highlight the effectiveness of combining semantic, lexical, and pragmatic cues for fine-grained political sentiment classification in Tamil.- Anthology ID:
- 2026.dravidianlangtech-1.61
- Volume:
- Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages
- Month:
- July
- Year:
- 2026
- Address:
- Underline (Virtual)
- Editors:
- Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Sajeetha Thavareesan, Saranya Rajiakodi, Subalalitha Navaneethakrishnan, Dhivya Chinnappa, Balasubramanian Palani, Malliga Subramanian, Kogilavani Shanmugavadivel, Ratnavel Rajalakshmi
- Venues:
- DravidianLangTech | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 387–391
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.dravidianlangtech-1.61/
- DOI:
- Cite (ACL):
- Kanimozhi Selvi C S, Inigashree N S, Kavinraj J, and Moneissh A G. 2026. TamilEcho_Political@DravidianLangTech 2026: Hybrid XLM-RoBERTa with Sarcasm-Aware Feature Fusion for Political Multiclass Sentiment Analysis in Tamil X. In Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, pages 387–391, Underline (Virtual). Association for Computational Linguistics.
- Cite (Informal):
- TamilEcho_Political@DravidianLangTech 2026: Hybrid XLM-RoBERTa with Sarcasm-Aware Feature Fusion for Political Multiclass Sentiment Analysis in Tamil X (S et al., DravidianLangTech 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.dravidianlangtech-1.61.pdf