Exact Diffusion Inversion via Bidirectional Integration Approximation

Abstract

Recently, various methods have been proposed to address the inconsistency issue of DDIM inversion to enable image editing, such as EDICT [?] and Null-text inversion [?]. However, the above methods introduce considerable computational overhead. In this paper, we propose a new technique, named bidirectional integration approximation (BDIA), to perform exact diffusion inversion with negligible computational overhead. We consider a family of second order integration algorithms obtained by averaging forward and backward DDIM steps. The resulting approach estimates the next diffusion state as a linear combination of the estimated Gaussian noise at the current step and the previous and current diffusion states. This allows for exact backward computation of previous state given the current and next ones, leading to exact diffusion inversion. We perform a convergence analysis for BDIA-DDIM that includes the analysis for DDIM as a special case. It is demonstrated with experiments that BDIA-DDIM is effective for (round-trip) prompt-driven image editing. Our experiments further show that BDIA-DDIM produces markedly better image sampling quality than DDIM and EDICT for text-to-image generation and conventional image sampling.1 1 BDIA can also be applied to improve the performance of other ODE solvers in addition to DDIM. In particular, it is found that applying BDIA to the EDM sampling procedure produces consistently better performance over four pre-trained models (see Alg. ?? and Table ?? in Appendix ??).

Cite

Text

Zhang et al. "Exact Diffusion Inversion via Bidirectional Integration Approximation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72998-0_2

Markdown

[Zhang et al. "Exact Diffusion Inversion via Bidirectional Integration Approximation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhang2024eccv-exact/) doi:10.1007/978-3-031-72998-0_2

BibTeX

@inproceedings{zhang2024eccv-exact,
  title     = {{Exact Diffusion Inversion via Bidirectional Integration Approximation}},
  author    = {Zhang, Guoqiang and Lewis, J.P. and Kleijn, W. Bastiaan},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72998-0_2},
  url       = {https://mlanthology.org/eccv/2024/zhang2024eccv-exact/}
}