PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation

Abstract

This paper presents a unified framework for depth-aware panoptic segmentation (DPS), which aims to reconstruct 3D scene with instance-level semantics from one single image. Prior works address this problem by simply adding a dense depth regression head to panoptic segmentation (PS) networks, resulting in two independent task branches. This neglects the mutually-beneficial relations between these two tasks, thus failing to exploit handy instance-level semantic cues to boost depth accuracy while also producing sub-optimal depth maps. To overcome these limitations, we propose a unified framework for the DPS task by applying a dynamic convolution technique to both the PS and depth prediction tasks. Specifically, instead of predicting depth for all pixels at a time, we generate instance-specific kernels to predict depth and segmentation masks for each instance. Moreover, leveraging the instance-wise depth estimation scheme, we add additional instance-level depth cues to assist with supervising the depth learning via a new depth loss. Extensive experiments on Cityscapes-DPS and SemKITTI-DPS show the effectiveness and promise of our method. We hope our unified solution to DPS can lead a new paradigm in this area.

Cite

Text

Gao et al. "PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00168

Markdown

[Gao et al. "PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/gao2022cvpr-panopticdepth/) doi:10.1109/CVPR52688.2022.00168

BibTeX

@inproceedings{gao2022cvpr-panopticdepth,
  title     = {{PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation}},
  author    = {Gao, Naiyu and He, Fei and Jia, Jian and Shan, Yanhu and Zhang, Haoyang and Zhao, Xin and Huang, Kaiqi},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {1632-1642},
  doi       = {10.1109/CVPR52688.2022.00168},
  url       = {https://mlanthology.org/cvpr/2022/gao2022cvpr-panopticdepth/}
}