Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth
Abstract
Conventional self-supervised monocular depth prediction methods are based on a static environment assumption, which leads to accuracy degradation in dynamic scenes due to the mismatch and occlusion problems introduced by object motions. Existing dynamic-object-focused methods only partially solved the mismatch problem at the training loss level. In this paper, we accordingly propose a novel multi-frame monocular depth prediction method to solve these problems at both the prediction and supervision loss levels. Our method, called DynamicDepth, is a new framework trained via a self-supervised cycle consistent learning scheme. A Dynamic Object Motion Disentanglement (DOMD) module is proposed to disentangle object motions to solve the mismatch problem. Moreover, novel occlusion-aware Cost Volume and Re-projection Loss are designed to alleviate the occlusion effects of object motions. Extensive analyses and experiments on the Cityscapes and KITTI datasets show that our method significantly outperforms the state-of-the-art monocular depth prediction methods, especially in the areas of dynamic objects. Code is available at https://github.com/AutoAILab/DynamicDepth
Cite
Text
Feng et al. "Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19824-3_14Markdown
[Feng et al. "Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/feng2022eccv-disentangling/) doi:10.1007/978-3-031-19824-3_14BibTeX
@inproceedings{feng2022eccv-disentangling,
title = {{Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth}},
author = {Feng, Ziyue and Yang, Liang and Jing, Longlong and Wang, Haiyan and Tian, YingLi and Li, Bing},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19824-3_14},
url = {https://mlanthology.org/eccv/2022/feng2022eccv-disentangling/}
}