LOLNerf: Learn from One Look

Abstract

We present a method for learning a generative 3D model based on neural radiance fields, trained solely from data with only single views of each object. While generating realistic images is no longer a difficult task, producing the corresponding 3D structure such that they can be rendered from different views is non-trivial. We show that, unlike existing methods, one does not need multi-view data to achieve this goal. Specifically, we show that by reconstructing many images aligned to an approximate canonical pose with a single network conditioned on a shared latent space, you can learn a space of radiance fields that models shape and appearance for a class of objects. We demonstrate this by training models to reconstruct object categories using datasets that contain only one view of each subject without depth or geometry information. Our experiments show that we achieve state-of-the-art results in novel view synthesis and high-quality results for monocular depth prediction.

Cite

Text

Rebain et al. "LOLNerf: Learn from One Look." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00161

Markdown

[Rebain et al. "LOLNerf: Learn from One Look." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/rebain2022cvpr-lolnerf/) doi:10.1109/CVPR52688.2022.00161

BibTeX

@inproceedings{rebain2022cvpr-lolnerf,
  title     = {{LOLNerf: Learn from One Look}},
  author    = {Rebain, Daniel and Matthews, Mark and Yi, Kwang Moo and Lagun, Dmitry and Tagliasacchi, Andrea},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {1558-1567},
  doi       = {10.1109/CVPR52688.2022.00161},
  url       = {https://mlanthology.org/cvpr/2022/rebain2022cvpr-lolnerf/}
}