UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks

Abstract

Ultra-high-resolution image generation poses great challenges, such as increased semantic planning complexity and detail synthesis difficulties, alongside substantial training resource demands. We present UltraPixel, a novel architecture utilizing cascade diffusion models to generate high-quality images at multiple resolutions (\textit{e.g.}, 1K, 2K, and 4K) within a single model, while maintaining computational efficiency. UltraPixel leverages semantics-rich representations of lower-resolution images in a later denoising stage to guide the whole generation of highly detailed high-resolution images, significantly reducing complexity. Specifically, we introduce implicit neural representations for continuous upsampling and scale-aware normalization layers adaptable to various resolutions. Notably, both low- and high-resolution processes are performed in the most compact space, sharing the majority of parameters with less than 3$\%$ additional parameters for high-resolution outputs, largely enhancing training and inference efficiency. Our model achieves fast training with reduced data requirements, producing photo-realistic high-resolution images and demonstrating state-of-the-art performance in extensive experiments.

Cite

Text

Ren et al. "UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks." Neural Information Processing Systems, 2024. doi:10.52202/079017-3529

Markdown

[Ren et al. "UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/ren2024neurips-ultrapixel/) doi:10.52202/079017-3529

BibTeX

@inproceedings{ren2024neurips-ultrapixel,
  title     = {{UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks}},
  author    = {Ren, Jingjing and Li, Wenbo and Chen, Haoyu and Pei, Renjing and Shao, Bin and Guo, Yong and Peng, Long and Song, Fenglong and Zhu, Lei},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3529},
  url       = {https://mlanthology.org/neurips/2024/ren2024neurips-ultrapixel/}
}