Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Abstract
Language agents based on large language models (LLMs) have demonstrated great promise in automating web-based tasks. Recent work has shown that incorporating advanced planning algorithms, e.g., tree search, is advantageous over reactive planning for web agents. However, unlike simulated sandbox environments, real-world environments such as the web are rife with irreversible actions. This undermines the feasibility of backtracking, a cornerstone of (tree) search. Overly relying on test-time search also hurts efficiency. We advocate model-based planning for web agents that employs a world model to simulate and deliberate over the outcome of each candidate action before committing to one. We systematically explore this paradigm by: (1) Proposing a model-based planning framework, WebDreamer, which employs LLMs to serve as both world models and value functions; (2) Training specialized LLMs as world models with a scalable data synthesis pipeline. Empirical results demonstrate that WebDreamers achieves substantial performance improvements over reactive baselines. It is competitive, while being - times more efficient, with tree search in sandbox environments (VisualWebArena) and also works effectively on real-world websites (Online-Mind2Web and Mind2Web-Live). Furthermore, our trained world model, Dreamer-7B, performs comparable to GPT-4o, highlighting the potential of specialized world models for efficient and effective planning in complex web environments. All code, models, and data are publicly available at https://github.com/OSU-NLP-Group/WebDreamer
Cite
Text
Gu et al. "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents." Transactions on Machine Learning Research, 2025.Markdown
[Gu et al. "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/gu2025tmlr-your/)BibTeX
@article{gu2025tmlr-your,
title = {{Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents}},
author = {Gu, Yu and Zhang, Kai and Ning, Yuting and Zheng, Boyuan and Gou, Boyu and Xue, Tianci and Chang, Cheng and Srivastava, Sanjari and Xie, Yanan and Qi, Peng and Sun, Huan and Su, Yu},
journal = {Transactions on Machine Learning Research},
year = {2025},
url = {https://mlanthology.org/tmlr/2025/gu2025tmlr-your/}
}