ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot

Abstract

One-stage long-tailed recognition methods improve the overall performance in a "seesaw" manner, i.e., either sacrifice the head's accuracy for better tail classification or elevate the head's accuracy even higher but ignore the tail. Existing algorithms bypass such trade-off by a multi-stage training process: pre-training on imbalanced set and fine-tuning on balanced set. Though achieving promising performance, not only are they sensitive to the generalizability of the pre-trained model, but also not easily integrated into other computer vision tasks like detection and segmentation, where pre-training of classifier solely is not applicable. In this paper, we propose a one-stage long-tailed recognition scheme, ally complementary experts (ACE), where the expert is the most knowledgeable specialist in a sub-set that dominates its training, and is complementary to other experts in the less-seen categories without disturbed by what it has never seen. We design a distribution-adaptive optimizer to adjust the learning pace of each expert to avoid over-fitting. Without special bells and whistles, the vanilla ACE outperforms the current one-stage SOTA method by 3 10% on CIFAR10-LT, CIFAR100-LT, ImageNet-LT and iNaturalist datasets. It is also shown to be the first one to break the "seesaw" trade-off by improving the accuracy of the majority and minority categories simultaneously in only one stage.

Cite

Text

Cai et al. "ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00018

Markdown

[Cai et al. "ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/cai2021iccv-ace/) doi:10.1109/ICCV48922.2021.00018

BibTeX

@inproceedings{cai2021iccv-ace,
  title     = {{ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot}},
  author    = {Cai, Jiarui and Wang, Yizhou and Hwang, Jenq-Neng},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {112-121},
  doi       = {10.1109/ICCV48922.2021.00018},
  url       = {https://mlanthology.org/iccv/2021/cai2021iccv-ace/}
}