Web Sitemap Knowledge Can Enhance Autonomous Browsing

Yuyao Zhang, Hongyu Lu, Jiajie Jin, Hongjin Qian, Shiyu Li, Zhao Yang, Yutao Zhu, Ji-Rong Wen, Zhicheng Dou


Abstract
Recent advances in large language models (LLMs) have enabled web agents to perform interactive tasks on real-world websites. However, existing agents still suffer from limited robustness, efficiency, and task success, largely due to their lack of structural understanding of websites and the absence of browsing priors in pre-trained models. To address these challenges, this paper proposes the Web Agent Sitemap Protocol (WASP), an agent-oriented sitemap that integrate structured website knowledge into web agents. WASP adopts a dual-granularity design, providing global site-level structure and local page-level semantic and interaction guidance. We also introduce a framework LightASM for constructing such sitemaps by identifying core pages and generating concise semantic summaries and block-level descriptions. Experiments on real-world browsing benchmarks demonstrate that WASP substantially improves the robustness, efficiency, and effectiveness of LLM-based web agents without extra training.
Anthology ID:
2026.findings-acl.1465
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29307–29321
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1465/
DOI:
Bibkey:
Cite (ACL):
Yuyao Zhang, Hongyu Lu, Jiajie Jin, Hongjin Qian, Shiyu Li, Zhao Yang, Yutao Zhu, Ji-Rong Wen, and Zhicheng Dou. 2026. Web Sitemap Knowledge Can Enhance Autonomous Browsing. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29307–29321, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Web Sitemap Knowledge Can Enhance Autonomous Browsing (Zhang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1465.pdf
Checklist:
 2026.findings-acl.1465.checklist.pdf