Residual Learning in Diffusion Models

Junyu Zhang, Daochang Liu, Eunbyung Park, Shichao Zhang, Chang Xu

CVPR 2024 pp. 7289-7299

doi:10.1109/CVPR52733.2024.00696 /cvpr/2024/zhang2024cvpr-residual/

Abstract

Diffusion models (DMs) have achieved remarkable generative performance particularly with the introduction of stochastic differential equations (SDEs). Nevertheless a gap emerges in the model sampling trajectory constructed by reverse-SDE due to the accumulation of score estimation and discretization errors. This gap results in a residual in the generated images adversely impacting the image quality. To remedy this we propose a novel residual learning framework built upon a correction function. The optimized function enables to improve image quality via rectifying the sampling trajectory effectively. Importantly our framework exhibits transferable residual correction ability i.e. a correction function optimized for one pre-trained DM can also enhance the sampling trajectory constructed by other different DMs on the same dataset. Experimental results on four widely-used datasets demonstrate the effectiveness and transferable capability of our framework.

PDF CVPR Semantic Scholar

Cite

Text

Zhang et al. "Residual Learning in Diffusion Models." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00696

Markdown

[Zhang et al. "Residual Learning in Diffusion Models." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/zhang2024cvpr-residual/) doi:10.1109/CVPR52733.2024.00696

BibTeX

@inproceedings{zhang2024cvpr-residual,
  title     = {{Residual Learning in Diffusion Models}},
  author    = {Zhang, Junyu and Liu, Daochang and Park, Eunbyung and Zhang, Shichao and Xu, Chang},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {7289-7299},
  doi       = {10.1109/CVPR52733.2024.00696},
  url       = {https://mlanthology.org/cvpr/2024/zhang2024cvpr-residual/}
}