Yesterday’s News: Benchmarking Multi-Dimensional Out-of-Distribution Generalization of Misinformation Detection Models

Ivo Verhoeven, Pushkar Mishra, Ekaterina Shutova


Abstract
This article introduces misinfo-general, a benchmark dataset for evaluating misinformation models’ ability to perform out-of-distribution generalization. Misinformation changes rapidly, much more quickly than moderators can annotate at scale, resulting in a shift between the training and inference data distributions. As a result, misinformation detectors need to be able to perform out-of-distribution generalization, an attribute they currently lack. Our benchmark uses distant labeling to enable simulating covariate shifts in misinformation content. We identify time, event, topic, publisher, political bias, and misinformation type as important axes for generalization, and we evaluate a common class of baseline models on each. Using article metadata, we show how this model fails desiderata, which is not necessarily obvious from classification metrics. Finally, we analyze properties of the data to ensure limited presence of modelling shortcuts. We make the dataset and accompanying code publicly available.1
Anthology ID:
2026.cl-2.6
Volume:
Computational Linguistics, Volume 52, Issue 2 - June 2026
Month:
June
Year:
2026
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
619–675
Language:
URL:
https://preview.aclanthology.org/codex___ingest-cl-2026-issue-2/2026.cl-2.6/
DOI:
10.1162/coli.a.585
Bibkey:
Cite (ACL):
Ivo Verhoeven, Pushkar Mishra, and Ekaterina Shutova. 2026. Yesterday’s News: Benchmarking Multi-Dimensional Out-of-Distribution Generalization of Misinformation Detection Models. Computational Linguistics, 52(2):619–675.
Cite (Informal):
Yesterday’s News: Benchmarking Multi-Dimensional Out-of-Distribution Generalization of Misinformation Detection Models (Verhoeven et al., CL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/codex___ingest-cl-2026-issue-2/2026.cl-2.6.pdf