Terrence Li
2025
MemeQA: Holistic Evaluation for Meme Understanding
Khoi P. N. Nguyen
|
Terrence Li
|
Derek Lou Zhou
|
Gabriel Xiong
|
Pranav Balu
|
Nandhan Alahari
|
Alan Huang
|
Tanush Chauhan
|
Harshavardhan Bala
|
Emre Guzelordu
|
Affan Kashfi
|
Aaron Xu
|
Suyesh Shrestha
|
Megan Vu
|
Jerry Wang
|
Vincent Ng
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Automated meme understanding requires systems to demonstrate fine-grained visual recognition, commonsense reasoning, and extensive cultural knowledge. However, existing benchmarks for meme understanding only concern narrow aspects of meme semantics. To fill this gap, we present MemeQA, a dataset of over 9,000 multiple-choice questions designed to holistically evaluate meme comprehension across seven cognitive aspects. Experiments show that state-of-the-art Large Multimodal Models perform much worse than humans on MemeQA. While fine-tuning improves their performance, they still make many errors on memes wherein proper understanding requires going beyond surface-level sentiment. Moreover, injecting “None of the above” into the available options makes the questions more challenging for the models. Our dataset is publicly available at https://github.com/npnkhoi/memeqa.
2024
MemeIntent: Benchmarking Intent Description Generation for Memes
Jeongsik Park
|
Khoi P. N. Nguyen
|
Terrence Li
|
Suyesh Shrestha
|
Megan Kim Vu
|
Jerry Yining Wang
|
Vincent Ng
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue
While recent years have seen a surge of interest in the automatic processing of memes, much of the work in this area has focused on determining whether a meme contains malicious content. This paper proposes the new task of intent description generation: generating a description of the author’s intentions when creating the meme. To stimulate future work on this task, we (1) annotated a corpus of memes with the intents being perceived by the reader as well as the background knowledge needed to infer the intents and (2) established baseline performance on the intent description generation task using state-of-the-art large language models. Our results suggest the importance of background knowledge retrieval in intent description generation for memes.