NASIMLab at SemEval-2026 Task 9: A Comparative Analysis of Fine-Tuned Small Language Models vs. Generative Large Language Models for Multilingual Polarization Type Detection

Neel Sabhahit; Sanjeevan Selvaganapathy; Mehwish Nasim

NASIMLab at SemEval-2026 Task 9: A Comparative Analysis of Fine-Tuned Small Language Models vs. Generative Large Language Models for Multilingual Polarization Type Detection

Neel Sabhahit, Sanjeevan Selvaganapathy, Mehwish Nasim

Abstract

The POLAR dataset contains various social media texts that might be polarized (conflict-inducing or dangerously divisive). The task at hand is to identify whether any of the following types of polarization are present: political, racial/ethnic, religious, gender/sexual, and other types across 22 languages. In this paper, we propose a system of fine-tuned language-specific small language models and compare our approach with state-of-the-art large language models on the POLAR dataset. By fine-tuning models for each language, we demonstrate that fine-tuned small encoder-only models consistently outperform large language models, especially for low-resource languages. Our system performs well on this task for most low-resource languages, notably taking the top spot on the leaderboard in Burmese (mya), appearing within the top 10 for 12 languages, and within the top 20 for all remaining languages.

Anthology ID:: 2026.semeval-1.413
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3316–3327
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.413/
DOI:
Bibkey:
Cite (ACL):: Neel Sabhahit, Sanjeevan Selvaganapathy, and Mehwish Nasim. 2026. NASIMLab at SemEval-2026 Task 9: A Comparative Analysis of Fine-Tuned Small Language Models vs. Generative Large Language Models for Multilingual Polarization Type Detection. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 3316–3327, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: NASIMLab at SemEval-2026 Task 9: A Comparative Analysis of Fine-Tuned Small Language Models vs. Generative Large Language Models for Multilingual Polarization Type Detection (Sabhahit et al., SemEval 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.413.pdf
Supplementarymaterial:: 2026.semeval-1.413.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data