GauDP: Reinventing Multi-Agent Collaboration Through Gaussian-Image Synergy in Diffusion Policies

Abstract

Despite significant advances in robotic policy generation, effective coordination in embodied multi-agent systems remains a fundamental challenge—particularly in scenarios where agents must balance individual perspectives with global environmental awareness. Existing approaches often struggle to balance fine-grained local control with comprehensive scene understanding, resulting in limited scalability and compromised collaboration quality. In this paper, we present GauDP, a novel Gaussian-image synergistic representation that facilitates scalable, perception-aware imitation learning in multi-agent collaborative systems. Specifically, GauDP reconstructs a globally consistent 3D Gaussian field from local-view RGB images, allowing all agents to dynamically query task-relevant features from a shared scene representation. This design facilitates both fine-grained control and globally coherent behavior without requiring additional sensing modalities. We evaluate GauDP on the RoboFactory benchmark, which includes diverse multi-arm manipulation tasks. Our method achieves superior performance over existing image-based methods and approaches the effectiveness of point-cloud-driven methods, while maintaining strong scalability as the number of agents increases. Extensive ablations and visualizations further demonstrate the robustness and efficiency of our unified local-global perception framework for multi-agent embodied learning.

Cite

Text

Wang et al. "GauDP: Reinventing Multi-Agent Collaboration Through Gaussian-Image Synergy in Diffusion Policies." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wang et al. "GauDP: Reinventing Multi-Agent Collaboration Through Gaussian-Image Synergy in Diffusion Policies." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wang2025neurips-gaudp/)

BibTeX

@inproceedings{wang2025neurips-gaudp,
  title     = {{GauDP: Reinventing Multi-Agent Collaboration Through Gaussian-Image Synergy in Diffusion Policies}},
  author    = {Wang, Ziye and Kang, Li and Qin, Yiran and Ma, Jiahua and Peng, Zhanglin and Bai, Lei and Zhang, Ruimao},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wang2025neurips-gaudp/}
}