Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models

Chen, Chubin; Zhu, Jiashu; Feng, Xiaokun; Huang, Nisha; Zhu, Chen; Wu, Meiqi; Mao, Fangyuan; Wu, Jiahong; Chu, Xiangxiang; Li, Xiu

Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models

Chubin Chen, Jiashu Zhu, Xiaokun Feng, Nisha Huang, Chen Zhu, Meiqi Wu, Fangyuan Mao, Jiahong Wu, Xiangxiang Chu, Xiu Li

ICLR 2026

/iclr/2026/chen2026iclr-stochastic/

Abstract

Classifier-free Guidance (CFG) is a widely used technique in modern diffusion models for generating high-quality samples. However, through an empirical analysis on both Gaussian mixture models with closed-form solutions and real-world data distributions, we observe a discrepancy between the suboptimal results produced by CFG and the ground truth. The model's excessive reliance on these suboptimal predictions often leads to low fidelity and semantic incoherence. To address this issue, we first empirically demonstrate that the model's suboptimal predictions can be effectively refined using sub-networks of the model itself, without requiring additional training or the integration of external modules. Building on this insight, we propose **$S^2$-Guidance ($S$tochastic $S$elf-Guidance)**, a novel method that leverages stochastic block-dropping during the denoising process to construct sub-networks. This approach effectively guides the model away from potential low-quality predictions, thereby improving sample quality. Extensive qualitative and quantitative experiments across multiple standard benchmarks for text-to-image and text-to-video generation tasks demonstrate that **$S^2$-Guidance** delivers superior performance, consistently surpassing CFG and other advanced guidance strategies. Our code will be released.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Chen et al. "Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models." International Conference on Learning Representations, 2026.

Markdown

[Chen et al. "Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/chen2026iclr-stochastic/)

BibTeX

@inproceedings{chen2026iclr-stochastic,
  title     = {{Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models}},
  author    = {Chen, Chubin and Zhu, Jiashu and Feng, Xiaokun and Huang, Nisha and Zhu, Chen and Wu, Meiqi and Mao, Fangyuan and Wu, Jiahong and Chu, Xiangxiang and Li, Xiu},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/chen2026iclr-stochastic/}
}