Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection

Abstract

Few-shot object detection has made encouraging progress by reconstructing novel class objects using the feature representation learned upon a set of base classes. However, an implicit contradiction about reconstruction and classification is unfortunately ignored. On the one hand, to precisely reconstruct novel classes, the distributions of base classes should be close to those of novel classes (min-margin). On the other hand, to perform accurate classification, the distributions of either two classes must be far away from each other (max-margin). In this paper, we propose a class margin equilibrium (CME) approach, with the aim to optimize both feature space partition and novel class reconstruction in a systematic way. CME first converts the few-shot detection problem to the few-shot classification problem by using a fully connection layer to decouple localization features. CME then reserves adequate margin space for novel classes by introducing simple-yet-effective class margin loss during feature learning. Finally, CME pursues margin equilibrium by disturbing the features of novel class instances in an adversarial min-max fashion. Experiments on Pascal VOC and MS-COCO datasets show that CME improves two baseline detectors (up to 5% in average), achieving new state-of-the-art performance.

Cite

Text

Li et al. "Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00728

Markdown

[Li et al. "Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/li2021cvpr-beyond/) doi:10.1109/CVPR46437.2021.00728

BibTeX

@inproceedings{li2021cvpr-beyond,
  title     = {{Beyond Max-Margin: Class Margin Equilibrium for Few-Shot Object Detection}},
  author    = {Li, Bohao and Yang, Boyu and Liu, Chang and Liu, Feng and Ji, Rongrong and Ye, Qixiang},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {7363-7372},
  doi       = {10.1109/CVPR46437.2021.00728},
  url       = {https://mlanthology.org/cvpr/2021/li2021cvpr-beyond/}
}