Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Abstract

Multi-view depth estimation methods typically require the computation of a multi-view cost-volume, which leads to huge memory consumption and slow inference. Furthermore, multi-view matching can fail for texture-less surfaces, reflective surfaces and moving objects. For such failure modes, single-view depth estimation methods are often more reliable. To this end, we propose MaGNet, a novel framework for fusing single-view depth probability with multi-view geometry, to improve the accuracy, robustness and efficiency of multi-view depth estimation. For each frame, MaGNet estimates a single-view depth probability distribution, parameterized as a pixel-wise Gaussian. The distribution estimated for the reference frame is then used to sample per-pixel depth candidates. Such probabilistic sampling enables the network to achieve higher accuracy while evaluating fewer depth candidates. We also propose depth consistency weighting for the multi-view matching score, to ensure that the multi-view depth is consistent with the single-view predictions. The proposed method achieves state-of-the-art performance on ScanNet, 7-Scenes and KITTI. Qualitative evaluation demonstrates that our method is more robust against challenging artifacts such as texture-less/reflective surfaces and moving objects. Our code and model weights are available at https://github.com/baegwangbin/MaGNet.

Cite

Text

Bae et al. "Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00286

Markdown

[Bae et al. "Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/bae2022cvpr-multiview/) doi:10.1109/CVPR52688.2022.00286

BibTeX

@inproceedings{bae2022cvpr-multiview,
  title     = {{Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry}},
  author    = {Bae, Gwangbin and Budvytis, Ignas and Cipolla, Roberto},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {2842-2851},
  doi       = {10.1109/CVPR52688.2022.00286},
  url       = {https://mlanthology.org/cvpr/2022/bae2022cvpr-multiview/}
}