MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera
Abstract
In this paper, we propose MonoRec, a semi-supervised monocular dense reconstruction architecture that predicts depth maps from a single moving camera in dynamic environments. MonoRec is based on a multi-view stereo setting which encodes the information of multiple consecutive images in a cost volume. To deal with dynamic objects in the scene, we introduce a MaskModule that predicts moving object masks by leveraging the photometric inconsistencies encoded in the cost volumes. Unlike other multi-view stereo methods, MonoRec is able to reconstruct both static and moving objects by leveraging the predicted masks. Furthermore, we present a novel multi-stage training scheme with a semi-supervised loss formulation that does not require LiDAR depth values. We carefully evaluate MonoRec on the KITTI dataset and show that it achieves state-of-the-art performance compared to both multi-view and single-view methods. With the model trained on KITTI, we further demonstrate that MonoRec is able to generalize well to both the Oxford RobotCar dataset and the more challenging TUM-Mono dataset recorded by a handheld camera. Code and related materials are available at https://vision.in.tum.de/research/monorec.
Cite
Text
Wimbauer et al. "MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00605Markdown
[Wimbauer et al. "MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/wimbauer2021cvpr-monorec/) doi:10.1109/CVPR46437.2021.00605BibTeX
@inproceedings{wimbauer2021cvpr-monorec,
title = {{MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera}},
author = {Wimbauer, Felix and Yang, Nan and von Stumberg, Lukas and Zeller, Niclas and Cremers, Daniel},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {6112-6122},
doi = {10.1109/CVPR46437.2021.00605},
url = {https://mlanthology.org/cvpr/2021/wimbauer2021cvpr-monorec/}
}