State Combinatorial Generalization in Decision Making with Conditional Diffusion Models

Duan, Xintong; He, Yutong; Tajwar, Fahim; Chen, Wentse; Salakhutdinov, Ruslan; Schneider, Jeff

State Combinatorial Generalization in Decision Making with Conditional Diffusion Models

Xintong Duan, Yutong He, Fahim Tajwar, Wentse Chen, Ruslan Salakhutdinov, Jeff Schneider

TMLR 2025

/tmlr/2025/duan2025tmlr-state/

Abstract

Many real-world decision-making problems are combinatorial in nature, where states (e.g., surrounding traffic of a self-driving car) can be seen as a combination of basic elements (e.g., pedestrians, trees, and other cars). Due to combinatorial complexity, observing all combinations of basic elements in the training set is infeasible, which leads to an essential yet understudied problem of zero-shot generalization to states that are unseen combinations of previously seen elements. In this work, we first formalize this problem and then demonstrate how existing value-based reinforcement learning (RL) algorithms struggle due to unreliable value predictions in unseen states. We argue that this problem cannot be addressed with exploration alone, but requires more expressive and generalizable models. We demonstrate that behavior cloning with a conditioned diffusion model trained on successful trajectory generalizes better to states formed by new combinations of seen elements than traditional RL methods. Through experiments in maze, driving, and multiagent environments, we show that conditioned diffusion models outperform traditional RL techniques and highlight the broad applicability of our problem formulation.

PDF TMLR Semantic Scholar

Cite

Text

Duan et al. "State Combinatorial Generalization in Decision Making with Conditional Diffusion Models." Transactions on Machine Learning Research, 2025.

Markdown

[Duan et al. "State Combinatorial Generalization in Decision Making with Conditional Diffusion Models." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/duan2025tmlr-state/)

BibTeX

@article{duan2025tmlr-state,
  title     = {{State Combinatorial Generalization in Decision Making with Conditional Diffusion Models}},
  author    = {Duan, Xintong and He, Yutong and Tajwar, Fahim and Chen, Wentse and Salakhutdinov, Ruslan and Schneider, Jeff},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/duan2025tmlr-state/}
}