GenRC: Generative 3D Room Completion from Sparse Image Collections
Abstract
Sparse RGBD scene completion is a challenging task especially when considering consistent textures and geometries throughout the entire scene. Different from existing solutions that rely on human-designed text prompts or predefined camera trajectories, we propose , an automated training-free pipeline to complete a room-scale 3D mesh with high-fidelity textures. To achieve this, we first project the sparse RGBD images to a highly incomplete 3D mesh. Instead of iteratively generating novel views to fill in the void, we utilized our proposed E-Diffusion to generate a view-consistent panoramic RGBD image which ensures global geometry and appearance consistency. Furthermore, we maintain the input-output scene stylistic consistency through textual inversion to replace human-designed text prompts. To bridge the domain gap among datasets, E-Diffusion leverages models trained on large-scale datasets to generate diverse appearances. outperforms state-of-the-art methods under most appearance and geometric metrics on ScanNet and ARKitScenes datasets, even though is not trained on these datasets nor using predefined camera trajectories. Project page: https://minfenli.github. io/GenRC/ Diffusion models
Cite
Text
Li et al. "GenRC: Generative 3D Room Completion from Sparse Image Collections." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72913-3_9Markdown
[Li et al. "GenRC: Generative 3D Room Completion from Sparse Image Collections." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/li2024eccv-genrc/) doi:10.1007/978-3-031-72913-3_9BibTeX
@inproceedings{li2024eccv-genrc,
title = {{GenRC: Generative 3D Room Completion from Sparse Image Collections}},
author = {Li, Ming-Feng and Ku, Yueh-Feng and Yen, Hong-Xuan and Liu, Chi and Liu, Yu-Lun and Chen, Albert Y and Kuo, Cheng-Hao and Sun, Min},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72913-3_9},
url = {https://mlanthology.org/eccv/2024/li2024eccv-genrc/}
}