UMUTeam at SemEval-2024 Task 4: Multimodal Identification of Persuasive Techniques in Memes through Large Language Models

Ronghao Pan, José Antonio García-díaz, Rafael Valencia-garcía


Abstract
In this manuscript we describe the UMUTeam’s participation in SemEval-2024 Task 4, a shared task to identify different persuasion techniques in memes. The task is divided into three subtasks. One is a multimodal subtask of identifying whether a meme contains persuasion or not. The others are hierarchical multi-label classifications that consider textual content alone or a multimodal setting of text and visual content. This is a multilingual task, and we participated in all three subtasks but we focus only on the English dataset. Our approach is based on a fine-tuning approach with the pre-trained RoBERTa-large model. In addition, for multimodal cases with both textual and visual content, we used the LMM called LlaVa to extract image descriptions and combine them with the meme text. Our system performed well in three subtasks, achieving the tenth best result with an Hierarchical F1 of 64.774%, the fourth best in Subtask 2a with an Hierarchical F1 of 69.003%, and the eighth best in Subtask 2b with a Macro F1 of 78.660%.
Anthology ID:
2024.semeval-1.96
Volume:
Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
655–666
Language:
URL:
https://aclanthology.org/2024.semeval-1.96
DOI:
Bibkey:
Cite (ACL):
Ronghao Pan, José Antonio García-díaz, and Rafael Valencia-garcía. 2024. UMUTeam at SemEval-2024 Task 4: Multimodal Identification of Persuasive Techniques in Memes through Large Language Models. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 655–666, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
UMUTeam at SemEval-2024 Task 4: Multimodal Identification of Persuasive Techniques in Memes through Large Language Models (Pan et al., SemEval 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2024-07/2024.semeval-1.96.pdf
Supplementary material:
 2024.semeval-1.96.SupplementaryMaterial.txt
Supplementary material:
 2024.semeval-1.96.SupplementaryMaterial.zip