Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection
Abstract
3D object detection is essential for understanding 3D scenes. Contemporary techniques often require extensive annotated training data, yet obtaining point-wise annotations for point clouds is time-consuming and laborious. Recent developments in semi-supervised methods seek to mitigate this problem by employing a teacher-student framework to generate pseudo-labels for unlabeled point clouds. However, these pseudo-labels frequently suffer from insufficient diversity and inferior quality. To overcome these hurdles, we introduce an Agent-based Diffusion Model for Semi-supervised 3D Object Detection (Diff3DETR). Specifically, an agent-based object query generator is designed to produce object queries that effectively adapt to dynamic scenes while striking a balance between sampling locations and content embedding. Additionally, a box-aware denoising module utilizes the DDIM denoising process and the long-range attention in the transformer decoder to refine bounding boxes incrementally. Extensive experiments on ScanNet and SUN RGB-D datasets demonstrate that Diff3DETR outperforms state-of-the-art semi-supervised 3D object detection methods.
Cite
Text
Deng et al. "Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72754-2_4Markdown
[Deng et al. "Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/deng2024eccv-diff3detr/) doi:10.1007/978-3-031-72754-2_4BibTeX
@inproceedings{deng2024eccv-diff3detr,
title = {{Diff3DETR: Agent-Based Diffusion Model for Semi-Supervised 3D Object Detection}},
author = {Deng, Jiacheng and Lu, Jiahao and Zhang, Tianzhu},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72754-2_4},
url = {https://mlanthology.org/eccv/2024/deng2024eccv-diff3detr/}
}