VISaGE: Understanding Visual Generics and Exceptions

Stella Frank, Emily Allaway


Abstract
While Vision Language Models (VLMs) learn conceptual representations, in the form of generalized knowledge, during training, they are typically used to analyze individual instances. When evaluation instances are atypical, this paradigm results in tension between two priors in the model. The first is a pragmatic prior that the textual and visual input are both relevant, arising from VLM finetuning on congruent inputs; the second is a semantic prior that the conceptual representation is generally true for instances of the category. In order to understand how VLMs trade off these priors, we introduce a new evaluation dataset, VISaGE, consisting of both typical and exceptional images. In carefully balanced experiments, we show that conceptual understanding degrades when the assumption of congruency underlying the pragmatic prior is violated with incongruent images. This effect is stronger than the effect of the semantic prior when querying about individual instances
Anthology ID:
2025.emnlp-main.1655
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
32537–32546
Language:
URL:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.emnlp-main.1655/
DOI:
10.18653/v1/2025.emnlp-main.1655
Bibkey:
Cite (ACL):
Stella Frank and Emily Allaway. 2025. VISaGE: Understanding Visual Generics and Exceptions. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 32537–32546, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
VISaGE: Understanding Visual Generics and Exceptions (Frank & Allaway, EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.emnlp-main.1655.pdf
Checklist:
 2025.emnlp-main.1655.checklist.pdf