Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

Abstract

Neural Radiance Fields (NeRF) coupled with GANs represent a promising direction in the area of 3D reconstruction from a single view, owing to their ability to efficiently model arbitrary topologies. Recent work in this area, however, has mostly focused on synthetic datasets where exact ground-truth poses are known, and has overlooked pose estimation, which is important for certain downstream applications such as augmented reality (AR) and robotics. We introduce a principled end-to-end reconstruction framework for natural images, where accurate ground-truth poses are not available. Our approach recovers an SDF-parameterized 3D shape, pose, and appearance from a single image of an object, without exploiting multiple views during training. More specifically, we leverage an unconditional 3D-aware generator, to which we apply a hybrid inversion scheme where a model produces a first guess of the solution which is then refined via optimization. Our framework can de-render an image in as few as 10 steps, enabling its use in practical scenarios. We demonstrate state-of-the-art results on a variety of real and synthetic benchmarks.

Cite

Text

Pavllo et al. "Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00427

Markdown

[Pavllo et al. "Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/pavllo2023cvpr-shape/) doi:10.1109/CVPR52729.2023.00427

BibTeX

@inproceedings{pavllo2023cvpr-shape,
  title     = {{Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion}},
  author    = {Pavllo, Dario and Tan, David Joseph and Rakotosaona, Marie-Julie and Tombari, Federico},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {4391-4401},
  doi       = {10.1109/CVPR52729.2023.00427},
  url       = {https://mlanthology.org/cvpr/2023/pavllo2023cvpr-shape/}
}