ReDiffDet: Rotation-Equivariant Diffusion Model for Oriented Object Detection

Abstract

The diffusion model has been successfully applied to various detection tasks. However, it still faces several challenges when used for oriented object detection: objects that are arbitrarily rotated require the diffusion model to encode their orientation information; uncontrollable random boxes inaccurately locate objects with dense arrangements and extreme aspect ratios; oriented boxes result in the misalignment between them and image features. To overcome these limitations, we propose ReDiffDet, a framework that formulates oriented object detection as a rotation-equivariant denoising diffusion process. First, we represent an oriented box as a 2D Gaussian distribution, forming the basis of the denoising paradigm. The reverse process can be proven to be rotation-equivariant within this representation and model framework. Second, we design a conditional encoder with conditional boxes to prevent boxes from being randomly placed across the entire image. Third, we propose an aligned decoder for alignment between oriented boxes and image features. The extensive experiments demonstrate ReDiffDet achieves promising performance and significantly outperforms the diffusion-based baseline detector.

Cite

Text

Zhao et al. "ReDiffDet: Rotation-Equivariant Diffusion Model for Oriented Object Detection." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.02275

Markdown

[Zhao et al. "ReDiffDet: Rotation-Equivariant Diffusion Model for Oriented Object Detection." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/zhao2025cvpr-rediffdet/) doi:10.1109/CVPR52734.2025.02275

BibTeX

@inproceedings{zhao2025cvpr-rediffdet,
  title     = {{ReDiffDet: Rotation-Equivariant Diffusion Model for Oriented Object Detection}},
  author    = {Zhao, Jiaqi and Ding, Zeyu and Zhou, Yong and Zhu, Hancheng and Du, Wen-Liang and Yao, Rui},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {24429-24439},
  doi       = {10.1109/CVPR52734.2025.02275},
  url       = {https://mlanthology.org/cvpr/2025/zhao2025cvpr-rediffdet/}
}