MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement

Abstract

Existing works in single-image human reconstruction suffer from weak generalizability due to insufficient training data or 3D inconsistencies for a lack of comprehensive multi-view knowledge. In this paper, we introduce MagicMan, a human-specific multi-view diffusion model to generate high-quality novel views from a single reference image. As its core, we leverage a pre-trained 2D diffusion model as the generative prior for generalizability, with the parametric SMPL-X model as the 3D body prior to promote 3D awareness. To maintain consistency while generating denser views for improved 3D human reconstruction, we introduce hybrid multi-view attention to facilitate efficient and thorough information interchange across views. Besides, we present a geometry-aware dual branch to perform concurrent generation in both RGB and normal domains, further enhancing consistency via geometry cues. Last but not least, to address ill-shaped issues arising from inaccurate SMPL-X estimation, we propose a novel iterative refinement strategy, which progressively optimizes SMPL-X accuracy while enhancing the quality and consistency of the generated multi-views. Extensive experimental results demonstrate that our method significantly outperforms existing approaches in both novel view synthesis and subsequent 3D human reconstruction tasks.

Cite

Text

He et al. "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I3.32356

Markdown

[He et al. "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/he2025aaai-magicman/) doi:10.1609/AAAI.V39I3.32356

BibTeX

@inproceedings{he2025aaai-magicman,
  title     = {{MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement}},
  author    = {He, Xu and Wu, Zhiyong and Li, Xiaoyu and Kang, Di and Zhang, Chaopeng and Ye, Jiangnan and Chen, Liyang and Gao, Xiangjun and Zhang, Han and Zhuang, Haolin},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {3437-3445},
  doi       = {10.1609/AAAI.V39I3.32356},
  url       = {https://mlanthology.org/aaai/2025/he2025aaai-magicman/}
}