Learning Composable Diffusion Guidance for Motion Priors
Abstract
Diffusion models have emerged as a promising choice for learning robot skills from demonstrations. However, they face three problems: diffusion models are not sample-efficient, data is expensive to collect in robotics, and the space of tasks is combinatorially large. The established method to train diffusion models on skill demonstrations borrow from the literature on image generation, and results in a conditional distribution of robot actions given the visual, proprioceptive and other observations. However, they have little room to accommodate solutions for the aforementioned challenges, in addition to scaling the model size and paired observation-action data. In this work, we propose a novel method for training diffusion models termed ‘Composable Diffusion Guidance’ CoDiG to compositionally learn diffusion policies for robot skills. CoDiG decouples the observation modalities allowing the residual learning of one modality with respect to the others. While presenting a more intuitive modeling paradigm, CoDiG also enables the scaling of modalities such as robot motions independently. Our preliminary results show that visual CoDiG with motion-priors outperforms the conventional way of learning visuomotor policies using diffusion models on skills with relatively low-diversity of robot motion. Further experimentation is needed to evaluate the performance and robustness of CoDiG for different observation modalities, and on different classes of skills, such as long-horizon and precise manipulation.
Cite
Text
Patil et al. "Learning Composable Diffusion Guidance for Motion Priors." ICLR 2025 Workshops: WRL, 2025.Markdown
[Patil et al. "Learning Composable Diffusion Guidance for Motion Priors." ICLR 2025 Workshops: WRL, 2025.](https://mlanthology.org/iclrw/2025/patil2025iclrw-learning/)BibTeX
@inproceedings{patil2025iclrw-learning,
title = {{Learning Composable Diffusion Guidance for Motion Priors}},
author = {Patil, Omkar and Rosen, Eric and Gopalan, Nakul},
booktitle = {ICLR 2025 Workshops: WRL},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/patil2025iclrw-learning/}
}