G-NeRF: Geometry-Enhanced Novel View Synthesis from Single-View Images
Abstract
Novel view synthesis aims to generate new view images of a given view image collection. Recent attempts address this problem relying on 3D geometry priors (e.g. shapes sizes and positions) learned from multi-view images. However such methods encounter the following limitations: 1) they require a set of multi-view images as training data for a specific scene (e.g. face car or chair) which is often unavailable in many real-world scenarios; 2) they fail to extract the geometry priors from single-view images due to the lack of multi-view supervision. In this paper we propose a Geometry-enhanced NeRF (G-NeRF) which seeks to enhance the geometry priors by a geometry-guided multi-view synthesis approach followed by a depth-aware training. In the synthesis process inspired that existing 3D GAN models can unconditionally synthesize high-fidelity multi-view images we seek to adopt off-the-shelf 3D GAN models such as EG3D as a free source to provide geometry priors through synthesizing multi-view data. Simultaneously to further improve the geometry quality of the synthetic data we introduce a truncation method to effectively sample latent codes within 3D GAN models. To tackle the absence of multi-view supervision for single-view images we design the depth-aware training approach incorporating a depth-aware discriminator to guide geometry priors through depth maps. Experiments demonstrate the effectiveness of our method in terms of both qualitative and quantitative results.
Cite
Text
Huang et al. "G-NeRF: Geometry-Enhanced Novel View Synthesis from Single-View Images." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00964Markdown
[Huang et al. "G-NeRF: Geometry-Enhanced Novel View Synthesis from Single-View Images." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/huang2024cvpr-gnerf/) doi:10.1109/CVPR52733.2024.00964BibTeX
@inproceedings{huang2024cvpr-gnerf,
title = {{G-NeRF: Geometry-Enhanced Novel View Synthesis from Single-View Images}},
author = {Huang, Zixiong and Chen, Qi and Sun, Libo and Yang, Yifan and Wang, Naizhou and Wu, Qi and Tan, Mingkui},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {10117-10126},
doi = {10.1109/CVPR52733.2024.00964},
url = {https://mlanthology.org/cvpr/2024/huang2024cvpr-gnerf/}
}