On Self-Supervised Image Representations for GAN Evaluation

Abstract

The embeddings from CNNs pretrained on Imagenet classification are de-facto standard image representations for assessing GANs via FID, Precision and Recall measures. Despite broad previous criticism of their usage for non-Imagenet domains, these embeddings are still the top choice in most of the GAN literature. In this paper, we advocate the usage of the state-of-the-art self-supervised representations to evaluate GANs on the established non-Imagenet benchmarks. These representations, typically obtained via contrastive learning, are shown to provide better transfer to new tasks and domains, therefore, can serve as more universal embeddings of natural images. With extensive comparison of the recent GANs on the common datasets, we show that self-supervised representations produce a more reasonable ranking of models in terms of FID/Precision/Recall, while the ranking with classification-pretrained embeddings often can be misleading.

Cite

Text

Morozov et al. "On Self-Supervised Image Representations for GAN Evaluation." International Conference on Learning Representations, 2021.

Markdown

[Morozov et al. "On Self-Supervised Image Representations for GAN Evaluation." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/morozov2021iclr-selfsupervised/)

BibTeX

@inproceedings{morozov2021iclr-selfsupervised,
  title     = {{On Self-Supervised Image Representations for GAN Evaluation}},
  author    = {Morozov, Stanislav and Voynov, Andrey and Babenko, Artem},
  booktitle = {International Conference on Learning Representations},
  year      = {2021},
  url       = {https://mlanthology.org/iclr/2021/morozov2021iclr-selfsupervised/}
}