SubmissionNumber#=%=#21
FinalPaperTitle#=%=#nowhash at SemEval-2024 Task 4: Exploiting Fusion of Transformers for Detecting Persuasion Techniques in Multilingual Memes
ShortPaperTitle#=%=#
NumberOfPages#=%=#6
CopyrightSigned#=%=#Abu Nowhash Chowdhury
JobTitle#==#
Organization#==#
Abstract#==#Nowadays, memes are considered one of the most prominent forms of medium to disseminate information on social media. Memes are typically constructed in multilingual settings using visuals with texts. Sometimes people use memes to influence mass audiences through rhetorical and psychological techniques, such as causal oversimplification, name-calling, and smear. It is a challenging task to identify those techniques considering memes' multimodal characteristics. To address these challenges, SemEval-2024 Task 4 introduced a shared task focusing on detecting persuasion techniques in multilingual memes. This paper presents our participation in subtasks 1 and 2(b). We use a finetuned language-agnostic BERT sentence embedding (LaBSE) model to extract effective contextual features from meme text to address the challenge of identifying persuasion techniques in subtask 1. For subtask 2(b), We finetune the vision transformer and XLM-RoBERTa to extract effective contextual information from meme image and text data. Finally, we unify those features and employ a single feed-forward linear layer on top to obtain the prediction label. Experimental results on the SemEval 2024 Task 4 benchmark dataset manifested the potency of our proposed methods for subtasks 1 and 2(b).
Author{1}{Firstname}#=%=#Abu Nowhash
Author{1}{Lastname}#=%=#Chowdhury
Author{1}{Username}#=%=#nowhash
Author{1}{Email}#=%=#abunowhashchy@gmail.com
Author{1}{Affiliation}#=%=#Asian University for Women
Author{2}{Firstname}#=%=#Michal
Author{2}{Lastname}#=%=#Ptaszynski
Author{2}{Email}#=%=#michal@mail.kitami-it.ac.jp
Author{2}{Affiliation}#=%=#Kitami Institute of Technology

==========
èéáğö