Adversarial Generation of Continuous Images
Abstract
In most existing learning systems, images are typically viewed as 2D pixel arrays. However, in another paradigm gaining popularity, a 2D image is represented as an implicit neural representation (INR) -- an MLP that predicts an RGB pixel value given its (x,y) coordinate. In this paper, we propose two novel architectural techniques for building INR-based image decoders: factorized multiplicative modulation and multi-scale INRs, and use them to build a state-of-the-art continuous image GAN. Previous attempts to adapt INRs for image generation were limited to MNIST-like datasets and do not scale to complex real-world data. Our proposed INR-GAN architecture improves the performance of continuous image generators by several times, greatly reducing the gap between continuous image GANs and pixel-based ones. Apart from that, we explore several exciting properties of the INR-based decoders, like out-of-the-box superresolution, meaningful image-space interpolation, accelerated inference of low-resolution images, an ability to extrapolate outside of image boundaries, and strong geometric prior. The project page is located at https://universome.github.io/inr-gan.
Cite
Text
Skorokhodov et al. "Adversarial Generation of Continuous Images." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01061Markdown
[Skorokhodov et al. "Adversarial Generation of Continuous Images." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/skorokhodov2021cvpr-adversarial/) doi:10.1109/CVPR46437.2021.01061BibTeX
@inproceedings{skorokhodov2021cvpr-adversarial,
title = {{Adversarial Generation of Continuous Images}},
author = {Skorokhodov, Ivan and Ignatyev, Savva and Elhoseiny, Mohamed},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {10753-10764},
doi = {10.1109/CVPR46437.2021.01061},
url = {https://mlanthology.org/cvpr/2021/skorokhodov2021cvpr-adversarial/}
}