Learning Non-Volumetric Depth Fusion Using Successive Reprojections

Abstract

Given a set of input views, multi-view stereopsis techniques estimate depth maps to represent the 3D reconstruction of the scene; these are fused into a single, consistent, reconstruction -- most often a point cloud. In this work we propose to learn an auto-regressive depth refinement directly from data. While deep learning has improved the accuracy and speed of depth estimation significantly, learned MVS techniques remain limited to the planesweeping paradigm. We refine a set of input depth maps by successively reprojecting information from neighbouring views to leverage multi-view constraints. Compared to learning-based volumetric fusion techniques, an image-based representation allows significantly more detailed reconstructions; compared to traditional point-based techniques, our method learns noise suppression and surface completion in a data-driven fashion. Due to the limited availability of high-quality reconstruction datasets with ground truth, we introduce two novel synthetic datasets to (pre-)train our network. Our approach is able to improve both the output depth maps and the reconstructed point cloud, for both learned and traditional depth estimation front-ends, on both synthetic and real data.

Cite

Text

Donne and Geiger. "Learning Non-Volumetric Depth Fusion Using Successive Reprojections." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00782

Markdown

[Donne and Geiger. "Learning Non-Volumetric Depth Fusion Using Successive Reprojections." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/donne2019cvpr-learning/) doi:10.1109/CVPR.2019.00782

BibTeX

@inproceedings{donne2019cvpr-learning,
  title     = {{Learning Non-Volumetric Depth Fusion Using Successive Reprojections}},
  author    = {Donne, Simon and Geiger, Andreas},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00782},
  url       = {https://mlanthology.org/cvpr/2019/donne2019cvpr-learning/}
}