Diffusion Alignment as Variational Expectation-Maximization

Lee, Jaewoo; Kim, Minsu; Choi, Sanghyeok; Song, Inhyuck; Yun, Sujin; Kang, Hyeongyu; Shin, Woocheol; Yun, Taeyoung; Om, Kiyoung; Park, Jinkyoo

Diffusion Alignment as Variational Expectation-Maximization

Jaewoo Lee, Minsu Kim, Sanghyeok Choi, Inhyuck Song, Sujin Yun, Hyeongyu Kang, Woocheol Shin, Taeyoung Yun, Kiyoung Om, Jinkyoo Park

ICLR 2026

/iclr/2026/lee2026iclr-diffusion/

Abstract

Diffusion alignment aims to optimize diffusion models for the downstream objective. While existing methods based on reinforcement learning or direct backpropagation achieve considerable success in maximizing rewards, they often suffer from reward over-optimization and mode collapse. We introduce Diffusion Alignment as Variational Expectation-Maximization (DAV), a framework that formulates diffusion alignment as an iterative process alternating between two complementary phases: the E-step and the M-step. In the E-step, we employ test-time search to generate diverse and reward-aligned samples. In the M-step, we refine the diffusion model using samples discovered by the E-step. We demonstrate that DAV can optimize reward while preserving diversity for both continuous and discrete tasks: text-to-image synthesis and DNA sequence design. Our code is available at https://github.com/Jaewoopudding/dav.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Lee et al. "Diffusion Alignment as Variational Expectation-Maximization." International Conference on Learning Representations, 2026.

Markdown

[Lee et al. "Diffusion Alignment as Variational Expectation-Maximization." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/lee2026iclr-diffusion/)

BibTeX

@inproceedings{lee2026iclr-diffusion,
  title     = {{Diffusion Alignment as Variational Expectation-Maximization}},
  author    = {Lee, Jaewoo and Kim, Minsu and Choi, Sanghyeok and Song, Inhyuck and Yun, Sujin and Kang, Hyeongyu and Shin, Woocheol and Yun, Taeyoung and Om, Kiyoung and Park, Jinkyoo},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/lee2026iclr-diffusion/}
}