WebDancer: Towards Autonomous Information Seeking Agency

Abstract

Addressing intricate real-world problems necessitates in-depth information seeking and multi-step reasoning. Recent progress in agentic systems, exemplified by Deep Research, underscores the potential for autonomous multi-step research. In this work, we present a cohesive paradigm for building end-to-end agentic information seeking agents from a data-centric and training-stage perspective. Our approach consists of four key stages: (1) browsing data construction, (2) trajectories sampling, (3) supervised fine-tuning for effective cold start, and (4) reinforcement learning for enhanced generalisation. We instantiate this framework in a web agent based on the ReAct format, WebDancer. Empirical evaluations on the challenging GAIA and WebWalkerQA benchmarks demonstrate the strong performance of WebDancer, achieving considerable results and highlighting the efficacy of our training paradigm. Further analysis of agent training provides valuable insights and actionable, systematic pathways for developing more capable agentic models.

Cite

Text

Wu et al. "WebDancer: Towards Autonomous Information Seeking Agency." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wu et al. "WebDancer: Towards Autonomous Information Seeking Agency." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wu2025neurips-webdancer/)

BibTeX

@inproceedings{wu2025neurips-webdancer,
  title     = {{WebDancer: Towards Autonomous Information Seeking Agency}},
  author    = {Wu, Jialong and Li, Baixuan and Fang, Runnan and Yin, Wenbiao and Zhang, Liwen and Wang, Zhenglin and Tao, Zhengwei and Zhang, Ding-Chu and Xi, Zekun and Tang, Xiangru and Jiang, Yong and Xie, Pengjun and Huang, Fei and Zhou, Jingren},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wu2025neurips-webdancer/}
}