DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras
Abstract
We propose DiffuStereo, a novel system using only sparse cameras (8 in this work) for high-quality 3D human reconstruction. At its core is a novel diffusion-based stereo module, which introduces diffusion models, a type of powerful generative models, into the iterative stereo matching network. To this end, we design a new diffusion kernel and additional stereo constraints to facilitate stereo matching and depth estimation in the network. We further present a multi-level stereo network architecture to handle high-resolution (up to 4k) inputs without requiring unaffordable memory footprint. Given a set of sparse-view color images of a human, the proposed multi-level diffusion-based stereo network can produce highly accurate depth maps, which are then converted into a high-quality 3D human model through an efficient multi-view fusion strategy. Overall, our method enables automatic reconstruction of human models with quality on par to high-end dense-view camera rigs, and this is achieved using a much more light-weight hardware setup. Experiments show that our method outperforms state-of-the-art methods by a large margin both qualitatively and quantitatively.
Cite
Text
Shao et al. "DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19824-3_41Markdown
[Shao et al. "DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/shao2022eccv-diffustereo/) doi:10.1007/978-3-031-19824-3_41BibTeX
@inproceedings{shao2022eccv-diffustereo,
title = {{DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras}},
author = {Shao, Ruizhi and Zheng, Zerong and Zhang, Hongwen and Sun, Jingxiang and Liu, Yebin},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19824-3_41},
url = {https://mlanthology.org/eccv/2022/shao2022eccv-diffustereo/}
}