Joint 3D Instance Segmentation and Object Detection for Autonomous Driving

Zhou, Dingfu; Fang, Jin; Song, Xibin; Liu, Liu; Yin, Junbo; Dai, Yuchao; Li, Hongdong; Yang, Ruigang

doi:10.1109/CVPR42600.2020.00191

Joint 3D Instance Segmentation and Object Detection for Autonomous Driving

Dingfu Zhou, Jin Fang, Xibin Song, Liu Liu, Junbo Yin, Yuchao Dai, Hongdong Li, Ruigang Yang

CVPR 2020

doi:10.1109/CVPR42600.2020.00191 /cvpr/2020/zhou2020cvpr-joint/

Abstract

Currently, in Autonomous Driving (AD), most of the 3D object detection frameworks (either anchor- or anchor-free-based) consider the detection as a Bounding Box (BBox) regression problem. However, this compact representation is not sufficient to explore all the information of the objects. To tackle this problem, we propose a simple but practical detection framework to jointly predict the 3D BBox and instance segmentation. For instance segmentation, we propose a Spatial Embeddings (SEs) strategy to assemble all foreground points into their corresponding object centers. Base on the SE results, the object proposals can be generated based on a simple clustering strategy. For each cluster, only one proposal is generated. Therefore, the Non-Maximum Suppression (NMS) process is no longer needed here. Finally, with our proposed instance-aware ROI pooling, the BBox is refined by a second-stage network. Experimental results on the public KITTI dataset show that the proposed SEs can significantly improve the instance segmentation results compared with other feature embedding-based method. Meanwhile, it also outperforms most of the 3D object detectors on the KITTI testing benchmark.

PDF CVPR Semantic Scholar

Cite

Text

Zhou et al. "Joint 3D Instance Segmentation and Object Detection for Autonomous Driving." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00191

Markdown

[Zhou et al. "Joint 3D Instance Segmentation and Object Detection for Autonomous Driving." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/zhou2020cvpr-joint/) doi:10.1109/CVPR42600.2020.00191

BibTeX

@inproceedings{zhou2020cvpr-joint,
  title     = {{Joint 3D Instance Segmentation and Object Detection for Autonomous Driving}},
  author    = {Zhou, Dingfu and Fang, Jin and Song, Xibin and Liu, Liu and Yin, Junbo and Dai, Yuchao and Li, Hongdong and Yang, Ruigang},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00191},
  url       = {https://mlanthology.org/cvpr/2020/zhou2020cvpr-joint/}
}