MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Abstract

With advances in deep learning models and the availability of large-scale 3D datasets, we have recently witnessed significant progress in single-view 3D reconstruction. However, existing methods often fail to reconstruct physically based material properties given a single image, limiting their applicability in complicated scenarios. This paper presents a novel approach (named MAGE) for generating 3D geometry with realistic decomposed material properties given a single image as input. Our method leverages inspiration from traditional computer graphics deferred rendering pipelines to introduce a multi-view G-buffer estimation model. The proposed model estimates G-buffers for various views as multi-domain images, including XYZ coordinates, normals, albedo, roughness, and metallic properties from a single-view RGB image. To address the inherent ambiguity and inconsistency in generating G-buffers simultaneously, we also formulate a deterministic network from the pretrained diffusion models and propose a lighting response loss that enforces consistency across these domains using PBR principles. Finally, we propose a large-scale synthetic dataset rich in material diversity for our model training. Experimental results demonstrate the effectiveness of our method in producing high-quality 3D meshes with rich material properties. Our code and dataset can be found at https://www.whyy.site/paper/mage.

Cite

Text

Wang et al. "MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01026

Markdown

[Wang et al. "MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/wang2025cvpr-mage/) doi:10.1109/CVPR52734.2025.01026

BibTeX

@inproceedings{wang2025cvpr-mage,
  title     = {{MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model}},
  author    = {Wang, Haoyuan and Wang, Zhenwei and Long, Xiaoxiao and Lin, Cheng and Hancke, Gerhard and Lau, Rynson W.H.},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {10985-10995},
  doi       = {10.1109/CVPR52734.2025.01026},
  url       = {https://mlanthology.org/cvpr/2025/wang2025cvpr-mage/}
}