PlaneFormers: From Sparse View Planes to 3D Reconstruction

Abstract

We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success.

Cite

Text

Agarwala et al. "PlaneFormers: From Sparse View Planes to 3D Reconstruction." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20062-5_12

Markdown

[Agarwala et al. "PlaneFormers: From Sparse View Planes to 3D Reconstruction." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/agarwala2022eccv-planeformers/) doi:10.1007/978-3-031-20062-5_12

BibTeX

@inproceedings{agarwala2022eccv-planeformers,
  title     = {{PlaneFormers: From Sparse View Planes to 3D Reconstruction}},
  author    = {Agarwala, Samir and Jin, Linyi and Rockwell, Chris and Fouhey, David F.},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-20062-5_12},
  url       = {https://mlanthology.org/eccv/2022/agarwala2022eccv-planeformers/}
}