Scaling Agents via Continual Pre-Training
Abstract
Large language models (LLMs) have evolved into agentic systems capable of autonomous tool use and multi-step reasoning for complex problem-solving. However, post-training approaches building upon general-purpose foundation models consistently underperform in agentic tasks, particularly in open-source implementations. We identify the root cause: the absence of robust agentic foundation models forces models during post-training to simultaneously learn diverse agentic behaviors while aligning them to expert demonstrations, thereby creating fundamental optimization tensions. To this end, we are the first to propose incorporating Agentic Continual Pre-training (Agentic CPT) into the deep research agents training pipeline to build powerful agentic foundational models. Based on this approach, we develop a deep research agent model named AgentFounder. We evaluate our AgentFounder-30B on 10 benchmarks and achieve state-of-the-art performance while retains strong tool-use ability, notably 39.9% on BrowseComp-en, 43.3% on BrowseComp-zh, and 31.5% Pass@1 on HLE.
Cite
Text
Su et al. "Scaling Agents via Continual Pre-Training." International Conference on Learning Representations, 2026.Markdown
[Su et al. "Scaling Agents via Continual Pre-Training." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/su2026iclr-scaling/)BibTeX
@inproceedings{su2026iclr-scaling,
title = {{Scaling Agents via Continual Pre-Training}},
author = {Su, Liangcai and Zhang, Zhen and Li, Guangyu and Chen, Zhuo and Wang, Chenxi and Song, Maojia and Wang, Xinyu and Li, Kuan and Wu, Jialong and Chen, Xuanzhong and Qiao, Zile and Zhang, Zhongwang and Yin, Huifeng and Cai, Shihao and Fang, Runnan and Tao, Zhengwei and Yin, Wenbiao and Ye, Rui and Jiang, Yong and Zhang, Ningyu and Xie, Pengjun and Huang, Fei and Ye, Kai and Tu, Kewei and Qian, Chenxiong and Zhou, Jingren},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/su2026iclr-scaling/}
}