3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

Abstract

3DiffTection introduces a novel method for 3D object detection from single images utilizing a 3D-aware diffusion model for feature extraction. Addressing the resource-intensive nature of annotating large-scale 3D image data our approach leverages pretrained diffusion models traditionally used for 2D tasks and adapts them for 3D detection through geometric and semantic tuning. Geometrically we enhance the model to perform view synthesis from single images incorporating an epipolar warp operator. This process utilizes easily accessible posed image data eliminating the need for manual annotation. Semantically the model is further refined on target detection data. Both stages utilize ControlNet ensuring the preservation of original feature capabilities. Through our methodology we obtain 3D-aware features that excel in identifying cross-view point correspondences. In 3D detection 3DiffTection substantially surpasses previous benchmarks e.g. Cube-RCNN by 9.43% in AP3D on the Omni3D-ARkitscene dataset. Furthermore 3DiffTection demonstrates robust label efficiency and generalizes well to cross-domain data nearly matching fully-supervised models in zero-shot scenarios.

Cite

Text

Xu et al. "3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01010

Markdown

[Xu et al. "3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/xu2024cvpr-3difftection/) doi:10.1109/CVPR52733.2024.01010

BibTeX

@inproceedings{xu2024cvpr-3difftection,
  title     = {{3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features}},
  author    = {Xu, Chenfeng and Ling, Huan and Fidler, Sanja and Litany, Or},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {10617-10627},
  doi       = {10.1109/CVPR52733.2024.01010},
  url       = {https://mlanthology.org/cvpr/2024/xu2024cvpr-3difftection/}
}