BGDNet: Background-Guided Indoor Panorama Depth Estimation

Abstract

Depth estimation from single perspective image has received significant attention in the past decade, whereas the same task applied to single panoramic image remains comparatively under-explored. Most existing depth estimation models for panoramic images imitate models proposed for perspective images, which take RGB images as input and output depth directly. However, as demonstrated by our experiments, model performance drops significantly when the training and testing datasets greatly differ, since they over-fit the training data. To address this issue, we propose a novel method, referred to as the Background-guided Network (BGDNet), for more robust and accurate depth estimation from indoor panoramic images. Different from existing models, our proposed BGDNet first infers the background depth, namely from walls, floor and ceiling, via background masks, room layout and camera model. The background depth is then used to guide and improve the output foreground depth. We perform within dataset as well as cross-domain experiments on two benchmark datasets. The results show that BGDNet outperforms the state-of-the-art baselines, and is more robust to overfitting issues, with superior generalization across datasets.

Cite

Text

Chen et al. "BGDNet: Background-Guided Indoor Panorama Depth Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00134

Markdown

[Chen et al. "BGDNet: Background-Guided Indoor Panorama Depth Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/chen2024cvprw-bgdnet/) doi:10.1109/CVPRW63382.2024.00134

BibTeX

@inproceedings{chen2024cvprw-bgdnet,
  title     = {{BGDNet: Background-Guided Indoor Panorama Depth Estimation}},
  author    = {Chen, Jiajing and Wan, Zhiqiang and Narayana, Manjunath and Li, Yuguang and Hutchcroft, Will and Velipasalar, Senem and Kang, Sing Bing},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {1272-1281},
  doi       = {10.1109/CVPRW63382.2024.00134},
  url       = {https://mlanthology.org/cvprw/2024/chen2024cvprw-bgdnet/}
}