Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis

Abstract

Any-scale image synthesis offers an efficient and scalable solution to synthesize photo-realistic images at any scale, even going beyond 2K resolution. However, existing GAN-based solutions depend excessively on convolutions and a hierarchical architecture, which introduce inconsistency and the "texture sticking" issue when scaling the output resolution. From another perspective, INR-based generators are scale-equivariant by design, but their huge memory footprint and slow inference hinder these networks from being adopted in large-scale or real-time systems. In this work, we propose Column-Row Entangled Pixel Synthesisthes (CREPS), a new generative model that is both efficient and scale-equivariant without using any spatial convolutions or coarse-to-fine design. To save memory footprint and make the system scalable, we employ a novel bi-line representation that decomposes layer-wise feature maps into separate "thick" column and row encodings. Experiments on standard datasets, including FFHQ, LSUN-Church, and MetFaces, confirm CREPS' ability to synthesize scale-consistent and alias-free images up to 4K resolution with proper training and inference speed.

Cite

Text

Nguyen et al. "Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02146

Markdown

[Nguyen et al. "Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/nguyen2023cvpr-efficient/) doi:10.1109/CVPR52729.2023.02146

BibTeX

@inproceedings{nguyen2023cvpr-efficient,
  title     = {{Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis}},
  author    = {Nguyen, Thuan Hoang and Van Le, Thanh and Tran, Anh},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {22408-22417},
  doi       = {10.1109/CVPR52729.2023.02146},
  url       = {https://mlanthology.org/cvpr/2023/nguyen2023cvpr-efficient/}
}