FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding

Abstract

Emerging interests have been brought to recognize previously unseen objects given very few training examples, known as few-shot object detection (FSOD). Recent researches demonstrate that good feature embedding is the key to reach favorable few-shot learning performance. We observe object proposals with different Intersection-of-Union (IoU) scores are analogous to the intra-image augmentation used in contrastive visual representation learning. And we exploit this analogy and incorporate supervised contrastive learning to achieve more robust objects representations in FSOD. We present Few-Shot object detection via Contrastive proposals Encoding (FSCE), a simple yet effective approach to learning contrastive-aware object proposal encodings that facilitate the classification of detected objects. We notice the degradation of average precision (AP) for rare objects mainly comes from misclassifying novel instances as confusable classes. And we ease the misclassification issues by promoting instance level intra-class compactness and inter-class variance via our contrastive proposal encoding loss (CPE loss). Our design outperforms current state-of-the-art works in any shot and all data splits, with up to +8.8% on standard benchmark PASCAL VOC and +2.7% on challenging COCO benchmark. Code is available at: https://github.com/ MegviiDetection/FSCE.

Cite

Text

Sun et al. "FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00727

Markdown

[Sun et al. "FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/sun2021cvpr-fsce/) doi:10.1109/CVPR46437.2021.00727

BibTeX

@inproceedings{sun2021cvpr-fsce,
  title     = {{FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding}},
  author    = {Sun, Bo and Li, Banghuai and Cai, Shengcai and Yuan, Ye and Zhang, Chi},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {7352-7362},
  doi       = {10.1109/CVPR46437.2021.00727},
  url       = {https://mlanthology.org/cvpr/2021/sun2021cvpr-fsce/}
}