VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors

Abstract

We present a stereo-matching method for depth estimation from high-resolution images using visual hulls as priors and a memory-efficient technique for the correlation computation. Our method uses object masks extracted from supplementary views of the scene to guide the disparity estimation effectively reducing the search space for matches. This approach is specifically tailored to stereo rigs in volumetric capture systems where an accurate depth plays a key role in the downstream reconstruction task. To enable training and regression at high resolutions targeted by recent systems our approach extends a sparse correlation computation into a hybrid sparse-dense scheme suitable for application in leading recurrent network architectures. We evaluate the performance-efficiency trade-off of our method compared to state-of-the-art approaches and demonstrate the efficacy of the visual hull guidance. In addition we propose a training scheme for a further reduction of memory requirements during optimization facilitating training on high-resolution data.

Cite

Text

Plack et al. "VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors." Winter Conference on Applications of Computer Vision, 2025.

Markdown

[Plack et al. "VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors." Winter Conference on Applications of Computer Vision, 2025.](https://mlanthology.org/wacv/2025/plack2025wacv-vhs/)

BibTeX

@inproceedings{plack2025wacv-vhs,
  title     = {{VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors}},
  author    = {Plack, Markus and Dröge, Hannah and Van Holland, Leif and Hullin, Matthias B.},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2025},
  pages     = {505-514},
  url       = {https://mlanthology.org/wacv/2025/plack2025wacv-vhs/}
}