Delta Denoising Score

Abstract

This paper introduces Delta Denoising Score (DDS), a novel diffusion-based scoring technique that optimizes a parametric model for the task of image editing. Unlike the existing Score Distillation Sampling (SDS), which queries the generative model with a single image-text pair, DDS utilizes an additional fixed query of a reference image-text pair to generate delta scores that represent the difference between the outputs of the two queries. By estimating noisy gradient directions introduced by SDS using the source image and its text description, DDS provides cleaner gradient directions that modify the edited portions of the image while leaving others unchanged, yielding a distilled edit of the source image. The analysis presented in this paper supports the power of the new score for image-to-image translation. We further show that the new score can be used to train an effective zero-shot image translation model. The experimental results show that the proposed loss term outperforms existing methods in terms of stability and quality, highlighting its potential for real-world applications.

Cite

Text

Hertz et al. "Delta Denoising Score." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00221

Markdown

[Hertz et al. "Delta Denoising Score." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/hertz2023iccv-delta/) doi:10.1109/ICCV51070.2023.00221

BibTeX

@inproceedings{hertz2023iccv-delta,
  title     = {{Delta Denoising Score}},
  author    = {Hertz, Amir and Aberman, Kfir and Cohen-Or, Daniel},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {2328-2337},
  doi       = {10.1109/ICCV51070.2023.00221},
  url       = {https://mlanthology.org/iccv/2023/hertz2023iccv-delta/}
}