CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds

Abstract

We present a novel two-stage fully sparse convolutional 3D object detection framework, named CAGroup3D. Our proposed method first generates some high-quality 3D proposals by leveraging the class-aware local group strategy on the object surface voxels with the same semantic predictions, which considers semantic consistency and diverse locality abandoned in previous bottom-up approaches. Then, to recover the features of missed voxels due to incorrect voxel-wise segmentation, we build a fully sparse convolutional RoI pooling module to directly aggregate fine-grained spatial information from backbone for further proposal refinement. It is memory-and-computation efficient and can better encode the geometry-specific features of each 3D proposal. Our model achieves state-of-the-art 3D detection performance with remarkable gains of +3.6% on ScanNet V2 and +2.6% on SUN RGB-D in term of [email protected]. Code will be available at https://github.com/Haiyang-W/CAGroup3D.

Cite

Text

Wang et al. "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds." Neural Information Processing Systems, 2022.

Markdown

[Wang et al. "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/wang2022neurips-cagroup3d/)

BibTeX

@inproceedings{wang2022neurips-cagroup3d,
  title     = {{CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds}},
  author    = {Wang, Haiyang and Ding, Lihe and Dong, Shaocong and Shi, Shaoshuai and Li, Aoxue and Li, Jianan and Li, Zhenguo and Wang, Liwei},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/wang2022neurips-cagroup3d/}
}