Normal Assisted Stereo Depth Estimation

Abstract

Accurate stereo depth estimation plays a critical role in various 3D tasks in both indoor and outdoor environments. Recently, learning-based multi-view stereo methods have demonstrated competitive performance with limited number of views. However, in challenging scenarios, especially when building cross-view correspondences is hard, these methods still cannot produce satisfying results. In this paper, we study how to enforce the consistency between surface normal and depth at training time to improve the performance. We couple the learning of a multi-view normal estimation module and a multi-view depth estimation module. In addition, we propose a novel consistency loss to train an independent consistency module that refines the depths from depth/normal pairs. We find that the joint learning can improve both the prediction of normal and depth, and the accuracy and smoothness can be further improved by enforcing the consistency. Experiments on MVS, SUN3D, RGBD and Scenes11 demonstrate the effectiveness of our method and state-of-the-art performance.

Cite

Text

Kusupati et al. "Normal Assisted Stereo Depth Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00226

Markdown

[Kusupati et al. "Normal Assisted Stereo Depth Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/kusupati2020cvpr-normal/) doi:10.1109/CVPR42600.2020.00226

BibTeX

@inproceedings{kusupati2020cvpr-normal,
  title     = {{Normal Assisted Stereo Depth Estimation}},
  author    = {Kusupati, Uday and Cheng, Shuo and Chen, Rui and Su, Hao},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00226},
  url       = {https://mlanthology.org/cvpr/2020/kusupati2020cvpr-normal/}
}