LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Abstract

3D content creation has achieved significant progress in terms of both quality and speed. Although current feed-forward models can produce 3D objects in seconds, their resolution is constrained by the intensive computation required during training. In this paper, we introduce Large Multi-View Gaussian Model (LGM), a novel framework designed to generate high-resolution 3D models from text prompts or single-view images. Our key insights are two-fold: 1) 3D Representation: We propose multi-view Gaussian features as an efficient yet powerful representation, which can then be fused together for differentiable rendering. 2) 3D Backbone: We present an asymmetric U-Net as a high-throughput backbone operating on multi-view images, which can be produced from text or single-view image input by leveraging multi-view diffusion models. Extensive experiments demonstrate the high fidelity and efficiency of our approach. Notably, we maintain the fast speed to generate 3D objects within 5 seconds while boosting the training resolution to 512, thereby achieving high-resolution 3D content generation. Our project page is available at https://me.kiui.moe/ lgm/.

Cite

Text

Tang et al. "LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73235-5_1

Markdown

[Tang et al. "LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/tang2024eccv-lgm/) doi:10.1007/978-3-031-73235-5_1

BibTeX

@inproceedings{tang2024eccv-lgm,
  title     = {{LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation}},
  author    = {Tang, Jiaxiang and Chen, Zhaoxi and Chen, Xiaokang and Wang, Tengfei and Zeng, Gang and Liu, Ziwei},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73235-5_1},
  url       = {https://mlanthology.org/eccv/2024/tang2024eccv-lgm/}
}