Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis
Abstract
Any-scale image synthesis offers an efficient and scalable solution to synthesize photo-realistic images at any scale, even going beyond 2K resolution. However, existing GAN-based solutions depend excessively on convolutions and a hierarchical architecture, which introduce inconsistency and the "texture sticking" issue when scaling the output resolution. From another perspective, INR-based generators are scale-equivariant by design, but their huge memory footprint and slow inference hinder these networks from being adopted in large-scale or real-time systems. In this work, we propose Column-Row Entangled Pixel Synthesisthes (CREPS), a new generative model that is both efficient and scale-equivariant without using any spatial convolutions or coarse-to-fine design. To save memory footprint and make the system scalable, we employ a novel bi-line representation that decomposes layer-wise feature maps into separate "thick" column and row encodings. Experiments on standard datasets, including FFHQ, LSUN-Church, and MetFaces, confirm CREPS' ability to synthesize scale-consistent and alias-free images up to 4K resolution with proper training and inference speed.
Cite
Text
Nguyen et al. "Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02146Markdown
[Nguyen et al. "Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/nguyen2023cvpr-efficient/) doi:10.1109/CVPR52729.2023.02146BibTeX
@inproceedings{nguyen2023cvpr-efficient,
title = {{Efficient Scale-Invariant Generator with Column-Row Entangled Pixel Synthesis}},
author = {Nguyen, Thuan Hoang and Van Le, Thanh and Tran, Anh},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {22408-22417},
doi = {10.1109/CVPR52729.2023.02146},
url = {https://mlanthology.org/cvpr/2023/nguyen2023cvpr-efficient/}
}