WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model

Tianqing Fang; Hongming Zhang; Zhisong Zhang; Kaixin Ma; Wenhao Yu; Haitao Mi; Dong Yu (于东)

WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model

Tianqing Fang, Hongming Zhang, Zhisong Zhang, Kaixin Ma, Wenhao Yu, Haitao Mi, Dong Yu

Abstract

Agent self-improvement, where agents autonomously train their underlying Large Language Model (LLM) on self-sampled trajectories, shows promising results but often stagnates in web environments due to limited exploration and under-utilization of pretrained web knowledge. To improve the performance of self-improvement, we propose a novel framework that introduces a co-evolving World Model LLM. This world model predicts the next observation based on the current observation and action within the web environment. The World Model serves dual roles: (1) as a virtual web server generating self-instructed training data to continuously refine the agent’s policy, and (2) as an imagination engine during inference, enabling look-ahead simulation to guide action selection for the agent LLM. Experiments in real-world web environments (Mind2Web-Live, WebVoyager, and GAIA-web) show a 10% performance gain over existing self-evolving agents, demonstrating the efficacy and generalizability of our approach, without using any distillation from more powerful close-sourced models.

Anthology ID:: 2025.emnlp-main.454
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8970–8986
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.454/
DOI:
Bibkey:
Cite (ACL):: Tianqing Fang, Hongming Zhang, Zhisong Zhang, Kaixin Ma, Wenhao Yu, Haitao Mi, and Dong Yu. 2025. WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 8970–8986, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model (Fang et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.454.pdf
Checklist:: 2025.emnlp-main.454.checklist.pdf

PDF Cite Search Checklist Fix data