Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
Abstract
In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images. Many recent works solve this problem by first recovering a point cloud with disparity estimation and then apply a 3D detector. The disparity map is computed for the entire image, which is costly and fails to leverage category-specific prior. In contrast, we design an instance disparity estimation network (iDispNet) that predicts disparity only for pixels on objects of interest and learns a category-specific shape prior for more accurate disparity estimation. To address the challenge from scarcity of disparity annotation in training, we propose to use a statistical shape model to generate dense disparity pseudo-ground-truth without the need of LiDAR point clouds, which makes our system more widely applicable. Experiments on the KITTI dataset show that, even when LiDAR ground-truth is not available at training time, Disp R-CNN achieves competitive performance and outperforms previous state-of-the-art methods by 20% in terms of average precision. The code will be available at https://github.com/zju3dv/disprcnn.
Cite
Text
Sun et al. "Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01056Markdown
[Sun et al. "Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/sun2020cvpr-disp/) doi:10.1109/CVPR42600.2020.01056BibTeX
@inproceedings{sun2020cvpr-disp,
title = {{Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation}},
author = {Sun, Jiaming and Chen, Linghao and Xie, Yiming and Zhang, Siyu and Jiang, Qinhong and Zhou, Xiaowei and Bao, Hujun},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2020},
doi = {10.1109/CVPR42600.2020.01056},
url = {https://mlanthology.org/cvpr/2020/sun2020cvpr-disp/}
}