Any-Resolution Training for High-Resolution Image Synthesis

Abstract

Generative models operate at fixed resolution, even though natural images come in a variety of sizes. As high-resolution details are downsampled away and low-resolution images are discarded altogether, precious supervision is lost. We argue that every pixel matters and create datasets with variable-size images, collected at their native resolutions. To take advantage of varied-size data, we introduce continuous-scale training, a process that samples patches at random scales to train a new generator with variable output resolutions. First, conditioning the generator on a target scale allows us to generate higher resolution images than previously possible, without adding layers to the model. Second, by conditioning on continuous coordinates, we can sample patches that still obey a consistent global layout, which also allows for scalable training at higher resolutions. Controlled FFHQ experiments show that our method can take advantage of multi-resolution training data better than discrete multi-scale approaches, achieving better FID scores and cleaner high-frequency details. We also train on other natural image domains including churches, mountains, and birds, and demonstrate arbitrary scale synthesis with both coherent global layouts and realistic local details, going beyond 2K resolution in our experiments. Our project page is available at: https://chail.github.io/anyres-gan/.

Cite

Text

Chai et al. "Any-Resolution Training for High-Resolution Image Synthesis." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19787-1_10

Markdown

[Chai et al. "Any-Resolution Training for High-Resolution Image Synthesis." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/chai2022eccv-anyresolution/) doi:10.1007/978-3-031-19787-1_10

BibTeX

@inproceedings{chai2022eccv-anyresolution,
  title     = {{Any-Resolution Training for High-Resolution Image Synthesis}},
  author    = {Chai, Lucy and Gharbi, Michaël and Shechtman, Eli and Isola, Phillip and Zhang, Richard},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19787-1_10},
  url       = {https://mlanthology.org/eccv/2022/chai2022eccv-anyresolution/}
}