Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo

Abstract

Depth estimation from multi-view stereo images is one of the most fundamental and essential tasks in understanding a scene imaginary. In this paper, we propose a machine learning technique based on deep convolutional neural networks (CNNs) for multi-view stereo matching. The proposed method measures the matching cost to extract depth values between two-view stereo images among multi-view stereo images using a deep architecture. Moreover, we present the confidence estimation network for incorporating the cost volumes along the depth hypothesis in multiview stereo. Experiments show that our estimated depth map from multiple views shows the better performance than the other matching similarity measure on DTU dataset.

Cite

Text

Choi et al. "Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018. doi:10.1109/CVPRW.2018.00065

Markdown

[Choi et al. "Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2018.](https://mlanthology.org/cvprw/2018/choi2018cvprw-learning/) doi:10.1109/CVPRW.2018.00065

BibTeX

@inproceedings{choi2018cvprw-learning,
  title     = {{Learning Descriptor, Confidence, and Depth Estimation in Multi-View Stereo}},
  author    = {Choi, Sungil and Kim, Seungryong and Park, Kihong and Sohn, Kwanghoon},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2018},
  pages     = {276-282},
  doi       = {10.1109/CVPRW.2018.00065},
  url       = {https://mlanthology.org/cvprw/2018/choi2018cvprw-learning/}
}