Web Sitemap Knowledge Can Enhance Autonomous Browsing
Yuyao Zhang, Hongyu Lu, Jiajie Jin, Hongjin Qian, Shiyu Li, Zhao Yang, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou
Abstract
Recent advances in large language models (LLMs) have enabled web agents to perform interactive tasks on real-world websites. However, existing agents still suffer from limited robustness, efficiency, and task success, largely due to their lack of structural understanding of websites and the absence of browsing priors in pre-trained models. To address these challenges, this paper proposes the Web Agent Sitemap Protocol (WASP), an agent-oriented sitemap that integrate structured website knowledge into web agents. WASP adopts a dual-granularity design, providing global site-level structure and local page-level semantic and interaction guidance. We also introduce a framework LightASM for constructing such sitemaps by identifying core pages and generating concise semantic summaries and block-level descriptions. Experiments on real-world browsing benchmarks demonstrate that WASP substantially improves the robustness, efficiency, and effectiveness of LLM-based web agents without extra training.- Anthology ID:
- 2026.findings-acl.1465
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 29307–29321
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1465/
- DOI:
- Cite (ACL):
- Yuyao Zhang, Hongyu Lu, Jiajie Jin, Hongjin Qian, Shiyu Li, Zhao Yang, Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. 2026. Web Sitemap Knowledge Can Enhance Autonomous Browsing. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29307–29321, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Web Sitemap Knowledge Can Enhance Autonomous Browsing (Zhang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1465.pdf