Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Abstract

Diffusion models have recently been investigated as powerful generative solvers for image dehazing, owing to their remarkable capability to model the data distribution. However, the massive computational burden imposed by the retraining of diffusion models, coupled with the extensive sampling steps during the inference, limit the broader application of diffusion models in image dehazing. To address these issues, we explore the properties of hazy images in the semantic latent space of frozen pre-trained diffusion models, and propose a Diffusion Latent Inspired network for Image Dehazing, dubbed DiffLI2 D. Specifically, we first reveal that the semantic latent space of pre-trained diffusion models can represent the content and haze characteristics of hazy images, as the diffusion time-step changes. Building upon this insight, we integrate the diffusion latent representations at different time-steps into a delicately designed dehazing network to provide instructions for image dehazing. Our DiffLI2 D avoids re-training diffusion models and iterative sampling process by effectively utilizing the informative representations derived from the pre-trained diffusion models, which also offers a novel perspective for introducing diffusion models to image dehazing. Extensive experiments on multiple datasets demonstrate that the proposed method achieves superior performance to existing image dehazing methods.

Cite

Text

Yang et al. "Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72784-9_21

Markdown

[Yang et al. "Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/yang2024eccv-unleashing/) doi:10.1007/978-3-031-72784-9_21

BibTeX

@inproceedings{yang2024eccv-unleashing,
  title     = {{Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing}},
  author    = {Yang, Zizheng and Yu, Hu and Li, Bing and Zhang, Jinghao and Huang, Jie and Zhao, Feng},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72784-9_21},
  url       = {https://mlanthology.org/eccv/2024/yang2024eccv-unleashing/}
}