Theoretically Achieving Continuous Representation of Oriented Bounding Boxes

Abstract

Considerable efforts have been devoted to Oriented Object Detection (OOD). However one lasting issue regarding the discontinuity in Oriented Bounding Box (OBB) representation remains unresolved which is an inherent bottleneck for extant OOD methods. This paper endeavors to completely solve this issue in a theoretically guaranteed manner and puts an end to the ad-hoc efforts in this direction. Prior studies typically can only address one of the two cases of discontinuity: rotation and aspect ratio and often inadvertently introduce decoding discontinuity e.g. Decoding Incompleteness (DI) and Decoding Ambiguity (DA) as discussed in literature. Specifically we propose a novel representation method called Continuous OBB (COBB) which can be readily integrated into existing detectors e.g. Faster-RCNN as a plugin. It can theoretically ensure continuity in bounding box regression which to our best knowledge has not been achieved in literature for rectangle-based object representation. For fairness and transparency of experiments we have developed a modularized benchmark based on the open-source deep learning framework Jittor's detection toolbox JDet for OOD evaluation. On the popular DOTA dataset by integrating Faster-RCNN as the same baseline model our new method outperforms the peer method Gliding Vertex by 1.13% mAP50 (relative improvement 1.54%) and 2.46% mAP75 (relative improvement 5.91%) without any tricks.

Cite

Text

Xiao et al. "Theoretically Achieving Continuous Representation of Oriented Bounding Boxes." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01600

Markdown

[Xiao et al. "Theoretically Achieving Continuous Representation of Oriented Bounding Boxes." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/xiao2024cvpr-theoretically/) doi:10.1109/CVPR52733.2024.01600

BibTeX

@inproceedings{xiao2024cvpr-theoretically,
  title     = {{Theoretically Achieving Continuous Representation of Oriented Bounding Boxes}},
  author    = {Xiao, Zikai and Yang, Guoye and Yang, Xue and Mu, Taijiang and Yan, Junchi and Hu, Shimin},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {16912-16922},
  doi       = {10.1109/CVPR52733.2024.01600},
  url       = {https://mlanthology.org/cvpr/2024/xiao2024cvpr-theoretically/}
}