A Baseline for Few-Shot Image Classification

Abstract

Fine-tuning a deep network trained with the standard cross-entropy loss is a strong baseline for few-shot learning. When fine-tuned transductively, this outperforms the current state-of-the-art on standard datasets such as Mini-ImageNet, Tiered-ImageNet, CIFAR-FS and FC-100 with the same hyper-parameters. The simplicity of this approach enables us to demonstrate the first few-shot learning results on the ImageNet-21k dataset. We find that using a large number of meta-training classes results in high few-shot accuracies even for a large number of few-shot classes. We do not advocate our approach as the solution for few-shot learning, but simply use the results to highlight limitations of current benchmarks and few-shot protocols. We perform extensive studies on benchmark datasets to propose a metric that quantifies the "hardness" of a few-shot episode. This metric can be used to report the performance of few-shot algorithms in a more systematic way.

Cite

Text

Dhillon et al. "A Baseline for Few-Shot Image Classification." International Conference on Learning Representations, 2020.

Markdown

[Dhillon et al. "A Baseline for Few-Shot Image Classification." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/dhillon2020iclr-baseline/)

BibTeX

@inproceedings{dhillon2020iclr-baseline,
  title     = {{A Baseline for Few-Shot Image Classification}},
  author    = {Dhillon, Guneet S. and Chaudhari, Pratik and Ravichandran, Avinash and Soatto, Stefano},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/dhillon2020iclr-baseline/}
}