Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees
Abstract
Discrete diffusion models have recently gained significant prominence in applications involving natural language and graph data. A key factor influencing their effectiveness is the efficiency of discretized samplers. Among these, $\tau$-leaping samplers have become particularly popular due to their theoretical and empirical success. However, existing theoretical analyses of $\tau$-leaping often rely on somewhat restrictive and difficult-to-verify regularity assumptions, and their convergence bounds contain quadratic dependence on the vocabulary size. In this work, we introduce a new analytical approach for discrete diffusion models that removes the need for such assumptions. For the standard $\tau$-leaping method, we establish convergence guarantees in KL divergence that scale linearly with vocabulary size, improving upon prior results with quadratic dependence. Our approach is also more broadly applicable: it provides the first convergence guarantees for other widely used samplers, including the Euler method and Tweedie $\tau$-leaping. Central to our approach is a novel technique based on differential inequalities, offering a more flexible alternative to the traditional Girsanov change-of-measure methods. This technique may also be of independent interest for the analysis of other stochastic processes.
Cite
Text
Liang et al. "Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees." Advances in Neural Information Processing Systems, 2025.Markdown
[Liang et al. "Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/liang2025neurips-discrete/)BibTeX
@inproceedings{liang2025neurips-discrete,
title = {{Discrete Diffusion Models: Novel Analysis and New Sampler Guarantees}},
author = {Liang, Yuchen and Liang, Yingbin and Lai, Lifeng and Shroff, Ness},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/liang2025neurips-discrete/}
}