OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models

Hao Zheng; Zirui Pang; Ling Li; Zhijie Deng; Yuhan Pu; Zhaowei Zhu; Xiaobo Xia; Jiaheng Wei

OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models

Hao Zheng, Zirui Pang, Ling Li, Zhijie Deng, Yuhan Pu, Zhaowei Zhu, Xiaobo Xia, Jiaheng Wei

Abstract

Advances in Multimodal Large Language Models (MLLMs) intensify concerns about data safety, making Machine Unlearning (MU), the selective removal of harmful/private information, a critical necessity. However, existing MU benchmarks for MLLMs are limited by a lack of image diversity, coarse-grained unlearning target, and insufficient evaluation scenarios, which fail to capture the complexity of real-world applications. To facilitate the development of MLLMs unlearning and alleviate the aforementioned limitations, we introduce OFFSIDE, a novel benchmark for evaluating misinformation unlearning in MLLMs. This manually curated dataset contains 15.68K records for 80 players, providing a comprehensive framework with four test sets to assess forgetting efficacy, generalization, utility, and robustness. OFFSIDE supports advanced unlearning targets, such as fine-grained unlearning and visual rumor removal. Our extensive evaluation of multiple baselines not only extends key findings from LLM MU to MLLM MU: (1) unlearned rumors can be easily recovered through relearning and (2) all methods are vulnerable to prompt attacks, but also introduces novel insights in the context of MLLM: (1) unimodal methods fail to handle multimodal rumors, (2) unlearning efficacy is primarily driven by catastrophic forgetting statistically, and (3) all methods struggle with visual rumors (rumors embedded in images). These results expose significant vulnerabilities in current approaches, highlighting the need for more robust multimodal unlearning solutions.

Anthology ID:: 2026.findings-acl.613
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12602–12620
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.613/
DOI:
Bibkey:
Cite (ACL):: Hao Zheng, Zirui Pang, Ling Li, Zhijie Deng, Yuhan Pu, Zhaowei Zhu, Xiaobo Xia, and Jiaheng Wei. 2026. OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 12602–12620, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models (Zheng et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.613.pdf
Checklist:: 2026.findings-acl.613.checklist.pdf

PDF Cite Search Checklist Fix data