PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation
Abstract
This paper presents a unified framework for depth-aware panoptic segmentation (DPS), which aims to reconstruct 3D scene with instance-level semantics from one single image. Prior works address this problem by simply adding a dense depth regression head to panoptic segmentation (PS) networks, resulting in two independent task branches. This neglects the mutually-beneficial relations between these two tasks, thus failing to exploit handy instance-level semantic cues to boost depth accuracy while also producing sub-optimal depth maps. To overcome these limitations, we propose a unified framework for the DPS task by applying a dynamic convolution technique to both the PS and depth prediction tasks. Specifically, instead of predicting depth for all pixels at a time, we generate instance-specific kernels to predict depth and segmentation masks for each instance. Moreover, leveraging the instance-wise depth estimation scheme, we add additional instance-level depth cues to assist with supervising the depth learning via a new depth loss. Extensive experiments on Cityscapes-DPS and SemKITTI-DPS show the effectiveness and promise of our method. We hope our unified solution to DPS can lead a new paradigm in this area.
Cite
Text
Gao et al. "PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00168Markdown
[Gao et al. "PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/gao2022cvpr-panopticdepth/) doi:10.1109/CVPR52688.2022.00168BibTeX
@inproceedings{gao2022cvpr-panopticdepth,
title = {{PanopticDepth: A Unified Framework for Depth-Aware Panoptic Segmentation}},
author = {Gao, Naiyu and He, Fei and Jia, Jian and Shan, Yanhu and Zhang, Haoyang and Zhao, Xin and Huang, Kaiqi},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {1632-1642},
doi = {10.1109/CVPR52688.2022.00168},
url = {https://mlanthology.org/cvpr/2022/gao2022cvpr-panopticdepth/}
}