Irapuarani at SemEval-2025 Task 10: Evaluating Strategies Combining Small and Large Language Models for Multilingual Narrative Detection

Gabriel Assis; Lívia De Azevedo; Joao De Moraes; Laura Ribeiro; Aline Paes

Irapuarani at SemEval-2025 Task 10: Evaluating Strategies Combining Small and Large Language Models for Multilingual Narrative Detection

Gabriel Assis, Lívia De Azevedo, Joao De Moraes, Laura Ribeiro, Aline Paes

Abstract

This paper presents the Irapuarani team’s participation in SemEval-2025 Task 10, Subtask 2, which focuses on hierarchical multi-label classification of narratives from online news articles. We explored three distinct strategies: (1) a direct classification approach using a multilingual Small Language Model (SLM), disregarding the hierarchical structure; (2) a translation-based strategy where texts from multiple languages were translated into a single language using a Large Language Model (LLM), followed by classification with a monolingual SLM; and (3) a hybrid strategy leveraging an SLM to filter domains and an LLM to assign labels while accounting for the hierarchy. We conducted experiments on datasets in all available languages, namely Bulgarian, English, Hindi, Portuguese and Russian. Our results show that Strategy 2 is the most generalizable across languages, achieving test set rankings of 21st in English, 9th in Portuguese and Russian, 7th in Bulgarian, and 10th in Hindi.

Anthology ID:: 2025.semeval-1.7
Volume:: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 38–48
Language:
URL:: https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.7/
DOI:
Bibkey:
Cite (ACL):: Gabriel Assis, Lívia De Azevedo, Joao De Moraes, Laura Ribeiro, and Aline Paes. 2025. Irapuarani at SemEval-2025 Task 10: Evaluating Strategies Combining Small and Large Language Models for Multilingual Narrative Detection. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 38–48, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Irapuarani at SemEval-2025 Task 10: Evaluating Strategies Combining Small and Large Language Models for Multilingual Narrative Detection (Assis et al., SemEval 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.7.pdf

PDF Cite Search Fix data