Understanding and Simplifying Perceptual Distances

Abstract

Perceptual metrics based on features of deep Convolutional Neural Networks (CNNs) have shown remarkable success when used as loss functions in a range of computer vision problems and significantly outperform classical losses such as L1 or L2 in pixel space. The source of this success remains somewhat mysterious, especially since a good loss does not require a particular CNN architecture nor a particular training method. In this paper we show that similar success can be achieved even with losses based on features of a deep CNN with random filters. We use the tool of infinite CNNs to derive an analytical form for perceptual similarity in such CNNs, and prove that the perceptual distance between two images is equivalent to the maximum mean discrepancy (MMD) distance between local distributions of small patches in the two images. We use this equivalence to propose a simple metric for comparing two images which directly computes the MMD between local distributions of patches in the two images. Our proposed metric is simple to understand, requires no deep networks, and gives comparable performance to perceptual metrics in a range of computer vision tasks.

Cite

Text

Amir and Weiss. "Understanding and Simplifying Perceptual Distances." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01205

Markdown

[Amir and Weiss. "Understanding and Simplifying Perceptual Distances." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/amir2021cvpr-understanding/) doi:10.1109/CVPR46437.2021.01205

BibTeX

@inproceedings{amir2021cvpr-understanding,
  title     = {{Understanding and Simplifying Perceptual Distances}},
  author    = {Amir, Dan and Weiss, Yair},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {12226-12235},
  doi       = {10.1109/CVPR46437.2021.01205},
  url       = {https://mlanthology.org/cvpr/2021/amir2021cvpr-understanding/}
}