Overview of CCL25-Eval Task 1: The Fifth Spatial Cognition Evaluation (SpaCE2025)
Yuhang Qin, Liming Xiao, Nan Hu, Sirui Deng, Jingyuan Ma, Hyang Cui, Zihan Zhang, Chi Hsu Tsai, Jinkun Ding, Sumin Kang, Zhifang Sui, Weidong Zhan
Abstract
"The Fifth Spatial Cognition Evaluation (SpaCE2025) presents a benchmark aimed at evaluating the spatial semantic understanding and reasoning capabilities of Large Language Models(LLMs), primarily in Chinese.It consists of five subtasks: (1) Retrieving Spatial Referents(RSR), (2) Detecting Spatial Semantic Anomalies (DSA), (3) Recognizing Synonymous SpatialExpression (RSE), (4) Spatial Position Reasoning (SPR) in Chinese, and (5) SPR in English. The fourth and fifth subtask share the same content and structure, differing only in language, and are designed to assess the cross-linguistic spatial reasoning capability of LLMs. A total of 12 teams submitted their final results, and the best-performing team achieved an accuracy of 0.7931. The results suggest that while LLMs are capable of handling basic spatial semantic understanding tasks such as RSR, their performance on more complex tasks, such as DSA and RSE, still re-quires improvement. Additionally, finetuning methods that effectively activate LLMs’ reasoning ability are essential to improve their performance."- Anthology ID:
- 2025.ccl-2.4
- Volume:
- Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
- Month:
- August
- Year:
- 2025
- Address:
- Jinan, China
- Editors:
- Hongfei Lin, Bin Li, Hongye Tan
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 33–46
- Language:
- URL:
- https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.4/
- DOI:
- Cite (ACL):
- Yuhang Qin, Liming Xiao, Nan Hu, Sirui Deng, Jingyuan Ma, Hyang Cui, Zihan Zhang, Chi Hsu Tsai, Jinkun Ding, Sumin Kang, Zhifang Sui, and Weidong Zhan. 2025. Overview of CCL25-Eval Task 1: The Fifth Spatial Cognition Evaluation (SpaCE2025). In Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025), pages 33–46, Jinan, China. Chinese Information Processing Society of China.
- Cite (Informal):
- Overview of CCL25-Eval Task 1: The Fifth Spatial Cognition Evaluation (SpaCE2025) (Qin et al., CCL 2025)
- PDF:
- https://preview.aclanthology.org/ingest-ccl/2025.ccl-2.4.pdf