Yanfeng Zhang
2025
TurnBack: A Geospatial Route Cognition Benchmark for Large Language Models through Reverse Route
Hongyi Luo
|
Qing Cheng
|
Daniel Matos
|
Hari Krishna Gadi
|
Yanfeng Zhang
|
Lu Liu
|
Yongliang Wang
|
Niclas Zeller
|
Daniel Cremers
|
Liqiu Meng
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Humans can interpret geospatial information through natural language, while the geospatial cognition capabilities of Large Language Models (LLMs) remain underexplored. Prior research in this domain has been constrained by non-quantifiable metrics, limited evaluation datasets; unclear research hierarchies further compound these limitations. Therefore, we propose a scalable benchmark and conduct a comprehensive evaluation of the geospatial route cognition of LLMs. We create a large-scale evaluation dataset comprised of 36000 routes from 12 metropolises. Then, we introduce PathBuilder, a novel tool for converting natural language instructions into navigation routes, and vice versa, bridging the gap between geospatial information and natural language. Finally, we propose a new evaluation framework and metrics to rigorously assess 9 state-of-the-art (SOTA) LLMs, on the task of route reversal. The benchmark reveals that LLMs exhibit limited ability to reverse routes: most of the reverse routes neither return to the starting point nor are similar to the optimal route. Additionally, LLMs face challenges such as low robustness in route generation and high confidence for their incorrect answers.
Search
Fix author
Co-authors
- Qing Cheng 1
- Daniel Cremers 1
- Hari Krishna Gadi 1
- Lu Liu 1
- Hongyi Luo 1
- show all...