Group R-CNN for Weakly Semi-Supervised Object Detection with Points

Zhang, Shilong; Yu, Zhuoran; Liu, Liyang; Wang, Xinjiang; Zhou, Aojun; Chen, Kai

doi:10.1109/CVPR52688.2022.00920

Group R-CNN for Weakly Semi-Supervised Object Detection with Points

Shilong Zhang, Zhuoran Yu, Liyang Liu, Xinjiang Wang, Aojun Zhou, Kai Chen

CVPR 2022 pp. 9417-9426

doi:10.1109/CVPR52688.2022.00920 /cvpr/2022/zhang2022cvpr-group/

Abstract

We study the problem of weakly semi-supervised object detection with points (WSSOD-P), where the training data is combined by a small set of fully annotated images with bounding boxes and a large set of weakly-labeled images with only a single point annotated for each instance. The core of this task is to train a point-to-box regressor on well-labeled images that can be used to predict credible bounding boxes for each point annotation. We challenge the prior belief that existing CNN-based detectors are not compatible with this task. Based on the classic R-CNN architecture, we propose an effective point-to-box regressor: Group R-CNN. Group R-CNN first uses instance-level proposal grouping to generate a group of proposals for each point annotation and thus can obtain a high recall rate. To better distinguish different instances and improve precision, we propose instance-level proposal assignment to replace the vanilla assignment strategy adopted in original R-CNN methods. As naive instance-level assignment brings converging difficulty, we propose instance-aware representation learning which consists of instance-aware feature enhancement and instance-aware parameter generation to overcome this issue. Comprehensive experiments on the MS-COCO benchmark demonstrate the effectiveness of our method. Specifically, Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images, which is the most challenging scenario. The source code can be found at https://github.com/jshilong/GroupRCNN.

PDF CVPR Semantic Scholar

Cite

Text

Zhang et al. "Group R-CNN for Weakly Semi-Supervised Object Detection with Points." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00920

Markdown

[Zhang et al. "Group R-CNN for Weakly Semi-Supervised Object Detection with Points." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/zhang2022cvpr-group/) doi:10.1109/CVPR52688.2022.00920

BibTeX

@inproceedings{zhang2022cvpr-group,
  title     = {{Group R-CNN for Weakly Semi-Supervised Object Detection with Points}},
  author    = {Zhang, Shilong and Yu, Zhuoran and Liu, Liyang and Wang, Xinjiang and Zhou, Aojun and Chen, Kai},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {9417-9426},
  doi       = {10.1109/CVPR52688.2022.00920},
  url       = {https://mlanthology.org/cvpr/2022/zhang2022cvpr-group/}
}