Spherical Manifold Guided Diffusion Model for Panoramic Image Generation

Abstract

Panoramic image essentially acts as a pivotal role in emerging virtual reality and augmented reality scenarios; however, the generation of panoramic images are essentially challenging due to the intrinsic spherical geometry and spherical distortions caused by equirectangular projection (ERP). To address this, we start from the very basics of S^2 manifold inherent to panoramic images, and propose a novel spherical manifold convolution (SMConv) on S^2 manifold. Based on the SMConv operation, we propose a spherical manifold guided diffusion (SMGD) model for text-conditioned panoramic image generation, which can well accommodate the spherical geometry during generation. We further develop a novel evaluation method by calculating grouped Frechet inception distance (FID) on cube-map projections, which can well reflect the quality of generated panoramic images, compared to existing methods that randomly crop ERP-distorted content. Experiment results demonstrate that our SMGD model achieves the state-of-the-art generation quality and accuracy, whilst retaining the shortest sampling time in the text-conditioned panoramic image generation task. Codes are publicly available at https://github.com/chronos123/SMGD.

Cite

Text

Sun et al. "Spherical Manifold Guided Diffusion Model for Panoramic Image Generation." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.00547

Markdown

[Sun et al. "Spherical Manifold Guided Diffusion Model for Panoramic Image Generation." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/sun2025cvpr-spherical/) doi:10.1109/CVPR52734.2025.00547

BibTeX

@inproceedings{sun2025cvpr-spherical,
  title     = {{Spherical Manifold Guided Diffusion Model for Panoramic Image Generation}},
  author    = {Sun, Xiancheng and Xu, Mai and Li, Shengxi and Ma, Senmao and Deng, Xin and Jiang, Lai and Shen, Gang},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {5824-5834},
  doi       = {10.1109/CVPR52734.2025.00547},
  url       = {https://mlanthology.org/cvpr/2025/sun2025cvpr-spherical/}
}