Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Abstract
Deep Generative Models (DGMs), such as Diffusion Models, have achieved promising performance in approximating complex data distributions. However, it is rare to see their application to distributional Reinforcement Learning (RL), which remains dominated by the classical histogram-based methods that inevitably incur discretization errors. In this paper, we highlight that this gap stems from the non-linearity of modern DGMs, which conflicts with the linear structure of the Bellman equation, a key technique to permit efficiently training RL models. To address this, we introduce \emph{Bellman Diffusion}, a new DGM that preserves the necessary linearity by modeling both the gradient and scalar fields. We propose a novel divergence-based training technique to optimize neural network proxies and introduce a new stochastic differential equation for sampling. With these innovations, Bellman Diffusion is guaranteed to converge to the target distribution. Our experiments show that Bellman Diffusion not only achieves accurate field estimations and serves as an effective image generator, but also converges $1.5\times$ faster than traditional histogram-based baselines in distributional RL tasks. This work paves the way for the effective integration of DGMs into MDP applications, enabling more advanced decision-making frameworks.
Cite
Text
Li et al. "Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space." ICLR 2025 Workshops: FPI, 2025.Markdown
[Li et al. "Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space." ICLR 2025 Workshops: FPI, 2025.](https://mlanthology.org/iclrw/2025/li2025iclrw-bellman/)BibTeX
@inproceedings{li2025iclrw-bellman,
title = {{Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space}},
author = {Li, Yangming and Lai, Chieh-Hsin and Schönlieb, Carola-Bibiane and Mitsufuji, Yuki and Ermon, Stefano},
booktitle = {ICLR 2025 Workshops: FPI},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/li2025iclrw-bellman/}
}