Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

Abstract

We present an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion, and depth in a monocular camera setup without supervision. Our technical contributions are three-fold. First, we highlight the fundamental difference between inverse and forward projection while modeling the individual motion of each rigid object, and propose a geometrically correct projection pipeline using a neural forward projection module. Second, we design a unified instance-aware photometric and geometric consistency loss that holistically imposes self-supervisory signals for every background and object region. Lastly, we introduce a general-purpose auto-annotation scheme using any off-the-shelf instance segmentation and optical flow models to produce video instance segmentation maps that will be utilized as input to our training pipeline. These proposed elements are validated in a detailed ablation study. Through extensive experiments conducted on the KITTI and Cityscapes dataset, our framework is shown to outperform the state-of-the-art depth and motion estimation methods. Our code, dataset, and models are publicly available.

Cite

Text

Lee et al. "Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I3.16281

Markdown

[Lee et al. "Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/lee2021aaai-learning/) doi:10.1609/AAAI.V35I3.16281

BibTeX

@inproceedings{lee2021aaai-learning,
  title     = {{Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency}},
  author    = {Lee, Seokju and Im, Sunghoon and Lin, Stephen and Kweon, In So},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {1863-1872},
  doi       = {10.1609/AAAI.V35I3.16281},
  url       = {https://mlanthology.org/aaai/2021/lee2021aaai-learning/}
}