W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection

Zhang, Yongqiang; Bai, Yancheng; Ding, Mingli; Li, Yongqiang; Ghanem, Bernard

doi:10.1109/CVPR.2018.00103

W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection

Yongqiang Zhang, Yancheng Bai, Mingli Ding, Yongqiang Li, Bernard Ghanem

CVPR 2018

doi:10.1109/CVPR.2018.00103 /cvpr/2018/zhang2018cvpr-w2f/

Abstract

Weakly-supervised object detection has attracted much attention lately, since it does not require bounding box annotations for training. Although significant progress has also been made, there is still a large gap in performance between weakly-supervised and fully-supervised object detection. Recently, some works use pseudo ground-truths which are generated by a weakly-supervised detector to train a supervised detector. Such approaches incline to find the most representative parts of objects, and only seek one ground-truth box per class even though many same-class instances exist. To overcome these issues, we propose a weakly-supervised to fully-supervised framework, where a weakly-supervised detector is implemented using multiple instance learning. Then, we propose a pseudo ground-truth excavation (PGE) algorithm to find the pseudo ground-truth of each instance in the image. Moreover, the pseudo ground-truth adaptation (PGA) algorithm is designed to further refine the pseudo ground-truths from PGE. Finally, we use these pseudo ground-truths to train a fully-supervised detector. Extensive experiments on the challenging PASCAL VOC 2007 and 2012 benchmarks strongly demonstrate the effectiveness of our framework. We obtain 52.4% and 47.8% mAP on VOC2007 and VOC2012 respectively, a significant improvement over previous state-of-the-art methods.

PDF CVPR Semantic Scholar

Cite

Text

Zhang et al. "W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00103

Markdown

[Zhang et al. "W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/zhang2018cvpr-w2f/) doi:10.1109/CVPR.2018.00103

BibTeX

@inproceedings{zhang2018cvpr-w2f,
  title     = {{W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection}},
  author    = {Zhang, Yongqiang and Bai, Yancheng and Ding, Mingli and Li, Yongqiang and Ghanem, Bernard},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00103},
  url       = {https://mlanthology.org/cvpr/2018/zhang2018cvpr-w2f/}
}