MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

Abstract

In this paper, we propose MonoRec, a semi-supervised monocular dense reconstruction architecture that predicts depth maps from a single moving camera in dynamic environments. MonoRec is based on a multi-view stereo setting which encodes the information of multiple consecutive images in a cost volume. To deal with dynamic objects in the scene, we introduce a MaskModule that predicts moving object masks by leveraging the photometric inconsistencies encoded in the cost volumes. Unlike other multi-view stereo methods, MonoRec is able to reconstruct both static and moving objects by leveraging the predicted masks. Furthermore, we present a novel multi-stage training scheme with a semi-supervised loss formulation that does not require LiDAR depth values. We carefully evaluate MonoRec on the KITTI dataset and show that it achieves state-of-the-art performance compared to both multi-view and single-view methods. With the model trained on KITTI, we further demonstrate that MonoRec is able to generalize well to both the Oxford RobotCar dataset and the more challenging TUM-Mono dataset recorded by a handheld camera. Code and related materials are available at https://vision.in.tum.de/research/monorec.

Cite

Text

Wimbauer et al. "MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00605

Markdown

[Wimbauer et al. "MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/wimbauer2021cvpr-monorec/) doi:10.1109/CVPR46437.2021.00605

BibTeX

@inproceedings{wimbauer2021cvpr-monorec,
  title     = {{MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera}},
  author    = {Wimbauer, Felix and Yang, Nan and von Stumberg, Lukas and Zeller, Niclas and Cremers, Daniel},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {6112-6122},
  doi       = {10.1109/CVPR46437.2021.00605},
  url       = {https://mlanthology.org/cvpr/2021/wimbauer2021cvpr-monorec/}
}