PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment

Runyang You; Xinyue Mei; Mengyuan Zhou

PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment

Abstract

Understanding idioms in multimodal contexts poses significant challenges due to data scarcity, idiomatic ambiguity, and the need for effective alignment of visual and textual inputs. In this work, we introduce MIRA (Multimodal Idiom Recognition and Alignment), a training-free framework designed to address these challenges on the SemEval-2025 Task 1 (AdMIRe) benchmark. MIRA leverages powerful closed-source large language models (LLMs) and integrates three key innovations: bias correction via in-context learning, multi-step semantic-visual fusion, and a self-revision mechanism that iteratively refines its outputs through backward verification. By systematically processing and fusing multimodal inputs, MIRA generates high-quality, fine-grained image-text representations that enhance idiom comprehension across different languages and cultural contexts. Experimental evaluations in both English and Portuguese demonstrate that our approach achieves robust performance without the need for additional training, setting a new standard for multimodal idiom recognition.

Anthology ID:: 2025.semeval-1.161
Volume:: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1211–1216
Language:
URL:: https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.161/
DOI:
Bibkey:
Cite (ACL):: Runyang You, Xinyue Mei, and Mengyuan Zhou. 2025. PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1211–1216, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: PALI-NLP at SemEval 2025 Task 1: Multimodal Idiom Recognition and Alignment (You et al., SemEval 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.161.pdf

PDF Cite Search Fix data