Multi-Agent Collaboration via Evolving Orchestration
Abstract
Large language models (LLMs) have achieved remarkable results across diverse downstream tasks, but their monolithic nature restricts scalability and efficiency in complex problem-solving. While recent research explores multi-agent collaboration among LLMs, most approaches rely on static organizational structures that struggle to adapt as task complexity and agent numbers grow, resulting in coordination overhead and inefficiencies. To this end, we propose a puppeteer-style paradigm for LLM-based multi-agent collaboration, where a centralized orchestrator ("puppeteer") dynamically directs agents ("puppets") in response to evolving task states. This orchestrator is trained via reinforcement learning to adaptively sequence and prioritize agents, enabling flexible and evolvable collective reasoning. Experiments on closed- and open-domain scenarios show that this method achieves superior performance with reduced computational costs. Analyses further reveal that the key improvements consistently stem from the emergence of more compact, cyclic reasoning structures under the orchestrator’s evolution. Our code is available at https://github.com/OpenBMB/ChatDev/tree/puppeteer.
Cite
Text
Dang et al. "Multi-Agent Collaboration via Evolving Orchestration." Advances in Neural Information Processing Systems, 2025.Markdown
[Dang et al. "Multi-Agent Collaboration via Evolving Orchestration." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/dang2025neurips-multiagent/)BibTeX
@inproceedings{dang2025neurips-multiagent,
title = {{Multi-Agent Collaboration via Evolving Orchestration}},
author = {Dang, Yufan and Qian, Chen and Luo, Xueheng and Fan, Jingru and Xie, Zihao and Shi, Ruijie and Chen, Weize and Yang, Cheng and Che, Xiaoyin and Tian, Ye and Xiong, Xuantang and Han, Lei and Liu, Zhiyuan and Sun, Maosong},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/dang2025neurips-multiagent/}
}