Efficient Deep Learning for Stereo Matching
Abstract
In the past year, convolutional neural networks have been shown to perform extremely well for stereo estimation. However, current architectures rely on siamese networks which exploit concatenation followed by further processing layers, requiring a minute of GPU computation per image pair. In contrast, in this paper we propose a matching network which is able to produce very accurate results in less than a second of GPU computation. Towards this goal, we exploit a product layer which simply computes the inner product between the two representations of a siamese architecture. We train our network by treating the problem as multi-class classification, where the classes are all possible disparities. This allows us to get calibrated scores, which result in much better matching performance when compared to existing approaches.
Cite
Text
Luo et al. "Efficient Deep Learning for Stereo Matching." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.614Markdown
[Luo et al. "Efficient Deep Learning for Stereo Matching." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/luo2016cvpr-efficient/) doi:10.1109/CVPR.2016.614BibTeX
@inproceedings{luo2016cvpr-efficient,
title = {{Efficient Deep Learning for Stereo Matching}},
author = {Luo, Wenjie and Schwing, Alexander G. and Urtasun, Raquel},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2016},
doi = {10.1109/CVPR.2016.614},
url = {https://mlanthology.org/cvpr/2016/luo2016cvpr-efficient/}
}