AdDriftBench: A Benchmark for Detecting Data Drift and Label Drift in Short Video Advertising

Yinghao Song; Xiangji Zeng; Shuai Cui; Lu Sun; Zhaowei Liu; Yuan Yuan; Yulu Wang; Hai Zhou; Zhaohan Gong

doi:10.18653/v1/2025.findings-emnlp.545

AdDriftBench: A Benchmark for Detecting Data Drift and Label Drift in Short Video Advertising

Yinghao Song, Xiangji Zeng, Shuai Cui, Lu Sun, Zhaowei Liu, Yuan Yuan, Yulu Wang, Hai Zhou, Zhaohan Gong

Abstract

With the commercialization of short video platforms (SVPs), the demand for compliance auditing of advertising content has grown rapidly. The rise of large vision-language models (VLMs) offers new opportunities for automating ad content moderation. However, short video advertising scenarios present unique challenges due to data drift (DD) and label drift (LD). DD refers to rapid shifts in data distribution caused by advertisers to evade platform review mechanisms. LD arises from the evolving and increasingly standardized review guidelines of SVPs, which effectively alter the classification boundaries over time. Despite the significance of these phenomena, there is currently a lack of benchmark tools designed to evaluate model performance under such conditions. To address this gap, we propose AdDriftBench (ADB). The ADB dataset consists of 3,480 short video ads, including 2,280 examples labeled under data drift scenarios, designed to evaluate the generalization capabilities of VLMs under rapidly shifting content distributions. An additional 1,200 examples represent label drift scenarios, aimed at assessing VLMs’ abilities in instruction following and fine-grained semantic understanding under varying auditing standards. Through extensive experiments on 16 open-source VLMs, we find that current models perform moderately in short video advertising contexts, particularly in handling fine-grained semantics and adapting to shifting instructions. Our dataset will be made publicly available.

Anthology ID:: 2025.findings-emnlp.545
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10305–10321
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.545/
DOI:: 10.18653/v1/2025.findings-emnlp.545
Bibkey:
Cite (ACL):: Yinghao Song, Xiangji Zeng, Shuai Cui, Lu Sun, Zhaowei Liu, Yuan Yuan, Yulu Wang, Hai Zhou, and Zhaohan Gong. 2025. AdDriftBench: A Benchmark for Detecting Data Drift and Label Drift in Short Video Advertising. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 10305–10321, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: AdDriftBench: A Benchmark for Detecting Data Drift and Label Drift in Short Video Advertising (Song et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.545.pdf
Checklist:: 2025.findings-emnlp.545.checklist.pdf

PDF Cite Search Checklist Fix data