Jingbo Yang
2026
WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks
Jingbo Yang | Bairu Hou | Wei Wei | Shiyu Chang | Yujia Bao
Findings of the Association for Computational Linguistics: ACL 2026
Jingbo Yang | Bairu Hou | Wei Wei | Shiyu Chang | Yujia Bao
Findings of the Association for Computational Linguistics: ACL 2026
Large-language-model (LLM) agents are becoming competent at straightforward web tasks, such as opening an item page or submitting a form, but still struggle with objectives that require long-horizon navigation, large-scale information extraction, and reasoning under constraints. We present WebDART, a general framework that enables a single LLM to handle such complex chores. WebDART (i) dynamically decomposes each objective into three focused subtasks—navigation, information extraction, and execution—so the model concentrates on one skill at a time, and (ii) continuously re-plans the decomposition as new webpages are revealed, taking advantage of newly discovered filters or shortcuts and avoiding redundant exploration. Evaluated on WebChoreArena, WebDART lifts end-to-end success rates by up to 13.7 percentage points over previous state-of-the-art agents, while matching their performance on the easier WebArena suite and completing tasks with up to 14.7 fewer navigation steps. Code will be publicly available.