Spatial Representation of Large Language Models in 2D Scene

WenyaWu WenyaWu, Weihong Deng


Abstract
Spatial representations are fundamental to human cognition, as understanding spatial relationships between objects is essential in daily life. Language serves as an indispensable tool for communicating spatial information, creating a close connection between spatial representations and spatial language. Large language models (LLMs), theoretically, possess spatial cognition due to their proficiency in natural language processing. This study examines the spatial representations of LLMs by employing traditional spatial tasks used in human experiments and comparing the models’ performance to that of humans. The results indicate that LLMs resemble humans in selecting spatial prepositions to describe spatial relationships and exhibit a preference for vertically oriented spatial terms. However, the human tendency to better represent locations along specific axes is absent in the performance of LLMs. This finding suggests that, although spatial language is closely linked to spatial representations, the two are not entirely equivalent.
Anthology ID:
2025.gem-1.3
Volume:
Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Month:
July
Year:
2025
Address:
Vienna, Austria and virtual meeting
Editors:
Kaustubh Dhole, Miruna Clinciu
Venues:
GEM | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18–29
Language:
URL:
https://preview.aclanthology.org/corrections-2025-08/2025.gem-1.3/
DOI:
Bibkey:
Cite (ACL):
WenyaWu WenyaWu and Weihong Deng. 2025. Spatial Representation of Large Language Models in 2D Scene. In Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²), pages 18–29, Vienna, Austria and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
Spatial Representation of Large Language Models in 2D Scene (WenyaWu & Deng, GEM 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-08/2025.gem-1.3.pdf