CAD: Scale Invariant Framework for Real-Time Object Detection

Abstract

Real-time detection frameworks that typically utilize end-to-end networks to scan the entire vision range, have shown potential effectiveness in object detection. However, compared to more accurate but time-consuming frameworks, detection accuracy of existing real-time networks are still left far behind. Towards this end, this work proposes a novel CAD framework to improve detection accuracy while preserving the real-time speed. Moreover, to enhance the generalization ability of the proposed framework, we introduce maxout [1] to approximate the correlation between image pixels and network predictions. In addition, the non-maximum weighted (NMW) [2] is employed to eliminate the redundant bounding boxes that are considered as repetitive detections for the same objects. Extensive experiments are conducted on two detection benchmarks to demonstrate that the proposed framework achieves state-of-the-art performance.

Cite

Text

Zhou et al. "CAD: Scale Invariant Framework for Real-Time Object Detection." IEEE/CVF International Conference on Computer Vision Workshops, 2017. doi:10.1109/ICCVW.2017.95

Markdown

[Zhou et al. "CAD: Scale Invariant Framework for Real-Time Object Detection." IEEE/CVF International Conference on Computer Vision Workshops, 2017.](https://mlanthology.org/iccvw/2017/zhou2017iccvw-cad/) doi:10.1109/ICCVW.2017.95

BibTeX

@inproceedings{zhou2017iccvw-cad,
  title     = {{CAD: Scale Invariant Framework for Real-Time Object Detection}},
  author    = {Zhou, Huajun and Li, Zechao and Ning, Chengcheng and Tang, Jinhui},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2017},
  pages     = {760-768},
  doi       = {10.1109/ICCVW.2017.95},
  url       = {https://mlanthology.org/iccvw/2017/zhou2017iccvw-cad/}
}