DiffSQL: Leveraging Diffusion Model for Zero-Shot Self-Supervised Monocular Depth Estimation

Abstract

Self-supervised monocular depth estimation has attracted significant attention due to its broad applications in autonomous driving and robotics. Although significant performance improvements have been achieved by learning the relative distance of objects with the introduction of Self Query Layer (SQL), it struggles with zero-shot generalization due to the lack of geometric features and the fixed number of query sizes. To address these problems, we propose a diffusion-augmented self-supervised depth estimation framework, named DiffSQL, to learn geometric priors for feature augmentation. Additionally, we introduce a dynamic self-query layer that implicitly computes the relative distances between objects by adjusting the query size according to the feature distribution. Experimental results on the KITTI dataset show that DiffSQL outperforms SQLdepth by 1.03% in terms of AbsRel and 2.79% in terms of SqRel. Furthermore, our experiments demonstrate that DiffSQL is superior in zero-shot generalization.

Cite

Text

Zheng et al. "DiffSQL: Leveraging Diffusion Model for Zero-Shot Self-Supervised Monocular Depth Estimation." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/981

Markdown

[Zheng et al. "DiffSQL: Leveraging Diffusion Model for Zero-Shot Self-Supervised Monocular Depth Estimation." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/zheng2025ijcai-diffsql/) doi:10.24963/IJCAI.2025/981

BibTeX

@inproceedings{zheng2025ijcai-diffsql,
  title     = {{DiffSQL: Leveraging Diffusion Model for Zero-Shot Self-Supervised Monocular Depth Estimation}},
  author    = {Zheng, Heyuan and Liang, Yunji and Liu, Lei and Yu, Zhiwen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {8823-8831},
  doi       = {10.24963/IJCAI.2025/981},
  url       = {https://mlanthology.org/ijcai/2025/zheng2025ijcai-diffsql/}
}