Declan Booth

2026

ILab-NLP at SemEval-2026 Task 9: Comparing XLM-RoBERTa and LLaMA-2 for Multilingual Polarization Detection
Declan Booth | Gavin Abercrombie | Simona Frenda
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)

This submission describes a system for SemEval-2026 Task 9, Subtask 1, focused on binary detection of polarized versus non-polarized posts in English and Spanish. We compare two approaches: a fine-tuned multilingual encoder model (XLM-RoBERTa) and a prompted generative model (LLaMA-2 7B). Our experiments show that XLM-RoBERTa delivers stronger and more stable performance overall, while LLaMA-2 is more prone to false positives in Spanish due to a strong bias toward predicting the polarized class. In addition to headline results, we analyse model behaviour using confidence signals and SHAP, and report efficiency measurements with CodeCarbon to highlight practical tradeoffs between performance and computational cost.

Co-authors

Venues

SemEval1
WS1

Fix author