Yijie Deng
2025
PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models
He Zhu
|
Junyou Su
|
Minxin Chen
|
Wen Wang
|
Yijie Deng
|
Guanhua Chen
|
Wenjia Zhang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
In the field of urban planning, existing Vision-Language Models (VLMs) frequently fail to effectively analyze planning maps, which are critical for urban planners and educational contexts. Planning maps require specialized understanding of spatial configurations, regulatory requirements, and multi-scale analysis.To address this challenge, we introduce PlanGPT-VL, the first domain-specific VLM tailored for urban planning maps. PlanGPT-VL employs three innovations:(1) PlanAnno-V framework for high-quality VQA data synthesis,(2) Critical Point Thinking (CPT) to reduce hallucinations through structured verification, and(3) PlanBench-V benchmark for systematic evaluation.Evaluation on PlanBench-V shows that PlanGPT-VL outperforms general-purpose VLMs on planning map interpretation tasks, with our 7B model achieving performance comparable to larger 72B models.
Search
Fix author
Co-authors
- Minxin Chen 1
- Guanhua Chen 1
- Junyou Su 1
- Wen Wang (王雯) 1
- Wenjia Zhang 1
- show all...
- He Zhu 1