Stereo Matching with Nonparametric Smoothness Priors in Feature Space
Abstract
We propose a novel formulation of stereo matching that considers each pixel as a feature vector. Under this view, matching two or more images can be cast as matching point clouds in feature space. We build a nonparametric depth smoothness model in this space that correlates the image features and depth values. This model induces a sparse graph that links pixels with similar features, thereby converting each point cloud into a connected network. This network defines a neighborhood system that captures pixel grouping hierarchies without resorting to image segmentation. We formulate global stereo matching over this neighborhood system and use graph cuts to match pixels between two or more such networks. We show that our stereo formulation is able to recover surfaces with different orders of smoothness, such as those with high-curvature details and sharp discontinuities. Furthermore, compared to other single-frame stereo methods, our method produces more temporally stable results from videos of dynamic scenes, even when applied to each frame independently.
Cite
Text
Smith et al. "Stereo Matching with Nonparametric Smoothness Priors in Feature Space." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206793Markdown
[Smith et al. "Stereo Matching with Nonparametric Smoothness Priors in Feature Space." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/smith2009cvpr-stereo/) doi:10.1109/CVPR.2009.5206793BibTeX
@inproceedings{smith2009cvpr-stereo,
title = {{Stereo Matching with Nonparametric Smoothness Priors in Feature Space}},
author = {Smith, Brandon M. and Zhang, Li and Jin, Hailin},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2009},
pages = {485-492},
doi = {10.1109/CVPR.2009.5206793},
url = {https://mlanthology.org/cvpr/2009/smith2009cvpr-stereo/}
}