HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors
Abstract
Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications in broader scenarios. To tackle these issues, we present HumanSplat, which predicts the 3D Gaussian Splatting properties of any human from a single input image in a generalizable manner.Specifically, HumanSplat comprises a 2D multi-view diffusion model and a latent reconstruction Transformer with human structure priors that adeptly integrate geometric priors and semantic features within a unified framework. A hierarchical loss that incorporates human semantic information is devised to achieve high-fidelity texture modeling and impose stronger constraints on the estimated multiple views. Comprehensive experiments on standard benchmarks and in-the-wild images demonstrate that HumanSplat surpasses existing state-of-the-art methods in achieving photorealistic novel-view synthesis. Project page: https://humansplat.github.io.
Cite
Text
Pan et al. "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors." Neural Information Processing Systems, 2024. doi:10.52202/079017-2367Markdown
[Pan et al. "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/pan2024neurips-humansplat/) doi:10.52202/079017-2367BibTeX
@inproceedings{pan2024neurips-humansplat,
title = {{HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors}},
author = {Pan, Panwang and Su, Zhuo and Lin, Chenguo and Fan, Zhen and Zhang, Yongjie and Li, Zeming and Shen, Tingting and Mu, Yadong and Liu, Yebin},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-2367},
url = {https://mlanthology.org/neurips/2024/pan2024neurips-humansplat/}
}