Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design

Ou, Zijing; Pani, Chinmay; Li, Yingzhen

Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design

ICLR 2026

/iclr/2026/ou2026iclr-inferencetime/

Abstract

Discrete diffusion models have become highly effective across various domains. However, real-world applications often require the generative process to adhere to certain constraints. To this end, we propose a Sequential Monte Carlo (SMC) framework that enables scalable inference-time control of discrete diffusion models through principled importance weighting and optimal proposal construction. Specifically, our approach derives tractable importance weights for a range of intermediate targets and characterises the optimal proposal, for which we develop two practical approximations: a first-order gradient-based approximation and an amortised proposal trained to minimise the log-variance of the importance weights. Empirical results across synthetic tasks, language modelling, biology design, and text-to-image generation demonstrate that our framework enhances controllability and sample quality, highlighting the effectiveness of SMC as a versatile recipe for scaling discrete diffusion models at inference time.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Ou et al. "Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design." International Conference on Learning Representations, 2026.

Markdown

[Ou et al. "Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/ou2026iclr-inferencetime/)

BibTeX

@inproceedings{ou2026iclr-inferencetime,
  title     = {{Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design}},
  author    = {Ou, Zijing and Pani, Chinmay and Li, Yingzhen},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/ou2026iclr-inferencetime/}
}