Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection

Ren, Zhongzheng; Yu, Zhiding; Yang, Xiaodong; Liu, Ming-Yu; Lee, Yong Jae; Schwing, Alexander G.; Kautz, Jan

doi:10.1109/CVPR42600.2020.01061

Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection

Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Yong Jae Lee, Alexander G. Schwing, Jan Kautz

CVPR 2020

doi:10.1109/CVPR42600.2020.01061 /cvpr/2020/ren2020cvpr-instanceaware/

Abstract

Weakly supervised learning has emerged as a compelling tool for object detection by reducing the need for strong supervision during training. However, major challenges remain: (1) differentiation of object instances can be ambiguous; (2) detectors tend to focus on discriminative parts rather than entire objects; (3) without ground truth, object proposals have to be redundant for high recalls, causing significant memory consumption. Addressing these challenges is difficult, as it often requires to eliminate uncertainties and trivial solutions. To target these issues we develop an instance-aware and context-focused unified framework. It employs an instance-aware self-training algorithm and a learnable Concrete DropBlock while devising a memory-efficient sequential batch back-propagation. Our proposed method achieves state-of-the-art results on COCO (12.1% AP, 24.8% AP50), VOC 2007 (54.9% AP), and VOC 2012 (52.1% AP), improving baselines by great margins. In addition, the proposed method is the first to benchmark ResNet based models and weakly supervised video object detection. Refer to our project page for code, models, and more details: https://github.com/NVlabs/wetectron.

PDF CVPR Semantic Scholar

Cite

Text

Ren et al. "Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01061

Markdown

[Ren et al. "Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/ren2020cvpr-instanceaware/) doi:10.1109/CVPR42600.2020.01061

BibTeX

@inproceedings{ren2020cvpr-instanceaware,
  title     = {{Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection}},
  author    = {Ren, Zhongzheng and Yu, Zhiding and Yang, Xiaodong and Liu, Ming-Yu and Lee, Yong Jae and Schwing, Alexander G. and Kautz, Jan},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01061},
  url       = {https://mlanthology.org/cvpr/2020/ren2020cvpr-instanceaware/}
}