Evaluating the Design Space of Diffusion-Based Generative Models

Abstract

Most existing theoretical investigations of the accuracy of diffusion models, albeit significant, assume the score function has been approximated to a certain accuracy, and then use this a priori bound to control the error of generation. This article instead provides a first quantitative understanding of the whole generation process, i.e., both training and sampling. More precisely, it conducts a non-asymptotic convergence analysis of denoising score matching under gradient descent. In addition, a refined sampling error analysis for variance exploding models is also provided. The combination of these two results yields a full error analysis, which elucidates (again, but this time theoretically) how to design the training and sampling processes for effective generation. For instance, our theory implies a preference toward noise distribution and loss weighting in training that qualitatively agree with the ones used in [Karras et al., 2022]. It also provides perspectives on the choices of time and variance schedules in sampling: when the score is well trained, the design in [Song et al., 2021] is more preferable, but when it is less trained, the design in [Karras et al., 2022] becomes more preferable.

Cite

Text

Wang et al. "Evaluating the Design Space of Diffusion-Based Generative Models." Neural Information Processing Systems, 2024. doi:10.52202/079017-0610

Markdown

[Wang et al. "Evaluating the Design Space of Diffusion-Based Generative Models." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/wang2024neurips-evaluating/) doi:10.52202/079017-0610

BibTeX

@inproceedings{wang2024neurips-evaluating,
  title     = {{Evaluating the Design Space of Diffusion-Based Generative Models}},
  author    = {Wang, Yuqing and He, Ye and Tao, Molei},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0610},
  url       = {https://mlanthology.org/neurips/2024/wang2024neurips-evaluating/}
}