SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation

Abstract

Monocular depth estimation is an ill-posed problem, and as such critically relies on scene priors and semantics. Due to its complexity, we propose a deep neural network model based on a semantic divide-and-conquer approach. Our model decomposes a scene into semantic segments, such as object instances and background stuff classes, and then predicts a scale and shift invariant depth map for each semantic segment in a canonical space. Semantic segments of the same category share the same depth decoder, so the global depth prediction task is decomposed into a series of category-specific ones, which are simpler to learn and easier to generalize to new scene types. Finally, our model stitches each local depth segment by predicting its scale and shift based on the global context of the image. The model is trained end-to-end using a multi-task loss for panoptic segmentation and depth prediction, and is therefore able to leverage large-scale panoptic segmentation datasets to boost its semantic understanding. We validate the effectiveness of our approach and show state-of-the-art performance on three benchmark datasets.

Cite

Text

Wang et al. "SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00062

Markdown

[Wang et al. "SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/wang2020cvpr-sdcdepth/) doi:10.1109/CVPR42600.2020.00062

BibTeX

@inproceedings{wang2020cvpr-sdcdepth,
  title     = {{SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation}},
  author    = {Wang, Lijun and Zhang, Jianming and Wang, Oliver and Lin, Zhe and Lu, Huchuan},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00062},
  url       = {https://mlanthology.org/cvpr/2020/wang2020cvpr-sdcdepth/}
}