Semantic Pyramid for Image Generation

Shocher, Assaf; Gandelsman, Yossi; Mosseri, Inbar; Yarom, Michal; Irani, Michal; Freeman, William T.; Dekel, Tali

doi:10.1109/CVPR42600.2020.00748

Semantic Pyramid for Image Generation

Assaf Shocher, Yossi Gandelsman, Inbar Mosseri, Michal Yarom, Michal Irani, William T. Freeman, Tali Dekel

CVPR 2020

doi:10.1109/CVPR42600.2020.00748 /cvpr/2020/shocher2020cvpr-semantic/

Abstract

We present a novel GAN-based model that utilizes the space of deep features learned by a pre-trained classification model. Inspired by classical image pyramid representations, we construct our model as a Semantic Generation Pyramid -- a hierarchical framework which leverages the continuum of semantic information encapsulated in such deep features; this ranges from low level information contained in fine features to high level, semantic information contained in deeper features. More specifically, given a set of features extracted from a reference image, our model generates diverse image samples, each with matching features at each semantic level of the classification model. We demonstrate that our model results in a versatile and flexible framework that can be used in various classic and novel image generation tasks. These include: generating images with a controllable extent of semantic similarity to a reference image, and different manipulation tasks such as semantically-controlled inpainting and compositing; all achieved with the same model, with no further training.

PDF CVPR Semantic Scholar

Cite

Text

Shocher et al. "Semantic Pyramid for Image Generation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00748

Markdown

[Shocher et al. "Semantic Pyramid for Image Generation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/shocher2020cvpr-semantic/) doi:10.1109/CVPR42600.2020.00748

BibTeX

@inproceedings{shocher2020cvpr-semantic,
  title     = {{Semantic Pyramid for Image Generation}},
  author    = {Shocher, Assaf and Gandelsman, Yossi and Mosseri, Inbar and Yarom, Michal and Irani, Michal and Freeman, William T. and Dekel, Tali},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00748},
  url       = {https://mlanthology.org/cvpr/2020/shocher2020cvpr-semantic/}
}