Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection

Abstract

3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent developments in semi-supervised methods seek to mitigate this problem by employing a teacher-student framework to generate pseudo-labels for unlabeled point clouds. However, these pseudo-labels frequently suffer from insufficient diversity and inferior quality. To overcome these hurdles, we introduce an Agent-based Diffusion Model for Semi-supervised 3D Object Detection (Diff3DETR). Specifically, an agent-based object query generator is designed to produce object queries that effectively adapt to dynamic scenes while striking a balance between sampling locations and content embedding. Additionally, a box-aware denoising module utilizes the DDIM denoising process and the long-range attention in the transformer decoder to refine bounding boxes incrementally. Extensive experiments on ScanNet and SUN RGB-D datasets demonstrate that Diff3DETR outperforms state-of-the-art semi-supervised 3D object detection methods.

Cite

Text

Deng et al. "Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72754-2_4

Markdown

[Deng et al. "Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/deng2024eccv-diff3detr/) doi:10.1007/978-3-031-72754-2_4

BibTeX

@inproceedings{deng2024eccv-diff3detr,
  title     = {{Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection}},
  author    = {Deng, Jiacheng and Lu, Jiahao and Zhang, Tianzhu},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72754-2_4},
  url       = {https://mlanthology.org/eccv/2024/deng2024eccv-diff3detr/}
}