Diffusion-SS3D: Diffusion Model for Semi-Supervised 3D Object Detection

Abstract

Semi-supervised object detection is crucial for 3D scene understanding, efficiently addressing the limitation of acquiring large-scale 3D bounding box annotations. Existing methods typically employ a teacher-student framework with pseudo-labeling to leverage unlabeled point clouds. However, producing reliable pseudo-labels in a diverse 3D space still remains challenging. In this work, we propose Diffusion-SS3D, a new perspective of enhancing the quality of pseudo-labels via the diffusion model for semi-supervised 3D object detection. Specifically, we include noises to produce corrupted 3D object size and class label distributions, and then utilize the diffusion model as a denoising process to obtain bounding box outputs. Moreover, we integrate the diffusion model into the teacher-student framework, so that the denoised bounding boxes can be used to improve pseudo-label generation, as well as the entire semi-supervised learning process. We conduct experiments on the ScanNet and SUN RGB-D benchmark datasets to demonstrate that our approach achieves state-of-the-art performance against existing methods. We also present extensive analysis to understand how our diffusion model design affects performance in semi-supervised learning. The source code will be available at https://github.com/luluho1208/Diffusion-SS3D.

Cite

Text

Ho et al. "Diffusion-SS3D: Diffusion Model for Semi-Supervised 3D Object Detection." Neural Information Processing Systems, 2023.

Markdown

[Ho et al. "Diffusion-SS3D: Diffusion Model for Semi-Supervised 3D Object Detection." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/ho2023neurips-diffusionss3d/)

BibTeX

@inproceedings{ho2023neurips-diffusionss3d,
  title     = {{Diffusion-SS3D: Diffusion Model for Semi-Supervised 3D Object Detection}},
  author    = {Ho, Cheng-Ju and Tai, Chen-Hsuan and Lin, Yen-Yu and Yang, Ming-Hsuan and Tsai, Yi-Hsuan},
  booktitle = {Neural Information Processing Systems},
  year      = {2023},
  url       = {https://mlanthology.org/neurips/2023/ho2023neurips-diffusionss3d/}
}