A Layered Approach to Stereo Reconstruction
Abstract
We propose a framework for extracting structure from stereo which represents the scene as a collection of approximately planar layers. Each layer consists of an explicit 3D plane equation, a colored image with per-pixel opacity (a sprite), and a per-pixel depth offset relative to the plane. Initial estimates of the layers are recovered using techniques taken from parametric motion estimation. These initial estimates are then refined using a re-synthesis algorithm which takes into account both occlusions and mixed pixels. Reasoning about such effects allows the recovery of depth and color information with high accuracy even in partially occluded regions. Another important benefit of our framework is that the output consists of a collection of approximately planar regions, a representation which is far more appropriate than a dense depth map for many applications such as rendering and video parsing.
Cite
Text
Baker et al. "A Layered Approach to Stereo Reconstruction." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1998. doi:10.1109/CVPR.1998.698642Markdown
[Baker et al. "A Layered Approach to Stereo Reconstruction." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1998.](https://mlanthology.org/cvpr/1998/baker1998cvpr-layered/) doi:10.1109/CVPR.1998.698642BibTeX
@inproceedings{baker1998cvpr-layered,
title = {{A Layered Approach to Stereo Reconstruction}},
author = {Baker, Simon and Szeliski, Richard and Anandan, P.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {1998},
pages = {434-441},
doi = {10.1109/CVPR.1998.698642},
url = {https://mlanthology.org/cvpr/1998/baker1998cvpr-layered/}
}