Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies

Ivo Bueno, Anna Bavaresco, João Miguel Cunha, Philipp Wicke


Abstract
Language and Vision-Language Models exhibit impressive language capabilities akin to human reasoning. However, unlike humans who acquire language through embodied, interactive experiences, these models learn from static datasets without real-world interaction. This difference raises questions about how they conceptualize abstract notions and whether their reasoning aligns with human cognition. We investigate spatial conceptualizations of LLMs and VLMs by conducting analogy prompting studies with LLMs, VLMs, and human participants. We assess their ability to generate and interpret analogies for spatial concepts. We quantitatively compare the analogies produced by each group, examining the impact of multimodal inputs and reasoning mechanisms. Our findings indicate that generative models can produce and interpret analogies but differ significantly from human reasoning in their abstraction of spatial concepts - variability influenced by input modality, model size, and prompting methods, with analogy-based prompts not consistently enhancing alignment. Contributions include a methodology for probing generative models through analogies; a comparative analysis of analogical reasoning among models, and humans; and insights into the effect of multimodal inputs on reasoning.
Anthology ID:
2025.analogyangle-1.9
Volume:
Proceedings of the 2nd Workshop on Analogical Abstraction in Cognition, Perception, and Language (Analogy-Angle II)
Month:
August
Year:
2025
Address:
Vienna, Austria
Editors:
Giulia Rambelli, Filip Ilievski, Marianna Bolognesi, Pia Sommerauer
Venues:
Analogy-Angle | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
108–132
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.analogyangle-1.9/
DOI:
Bibkey:
Cite (ACL):
Ivo Bueno, Anna Bavaresco, João Miguel Cunha, and Philipp Wicke. 2025. Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies. In Proceedings of the 2nd Workshop on Analogical Abstraction in Cognition, Perception, and Language (Analogy-Angle II), pages 108–132, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Testing Spatial Intuitions of Humans and Large Language and Multimodal Models in Analogies (Bueno et al., Analogy-Angle 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.analogyangle-1.9.pdf