Locally Consistent ToF and Stereo Data Fusion

Abstract

Depth estimation for dynamic scenes is a challenging and relevant problem in computer vision. Although this problem can be tackled by means of ToF cameras or stereo vision systems, each of the two systems alone has its own limitations. In this paper a framework for the fusion of 3D data produced by a ToF camera and a stereo vision system is proposed. Initially, depth data acquired by the ToF camera are up-sampled to the spatial resolution of the stereo vision images by a novel up-sampling algorithm based on image segmentation and bilateral filtering. In parallel a dense disparity field is obtained by a stereo vision algorithm. Finally, the up-sampled ToF depth data and the disparity field provided by stereo vision are synergically fused by enforcing the local consistency of depth data. The depth information obtained with the proposed framework is characterized by the high resolution of the stereo vision system and by an improved accuracy with respect to the one produced by both subsystems. Experimental results clearly show how the proposed method is able to outperform the compared fusion algorithms.

Cite

Text

Mutto et al. "Locally Consistent ToF and Stereo Data Fusion." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33863-2_62

Markdown

[Mutto et al. "Locally Consistent ToF and Stereo Data Fusion." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/mutto2012eccv-locally/) doi:10.1007/978-3-642-33863-2_62

BibTeX

@inproceedings{mutto2012eccv-locally,
  title     = {{Locally Consistent ToF and Stereo Data Fusion}},
  author    = {Mutto, Carlo Dal and Zanuttigh, Pietro and Mattoccia, Stefano and Cortelazzo, Guido Maria},
  booktitle = {European Conference on Computer Vision},
  year      = {2012},
  pages     = {598-607},
  doi       = {10.1007/978-3-642-33863-2_62},
  url       = {https://mlanthology.org/eccv/2012/mutto2012eccv-locally/}
}