Deep Learning for Confidence Information in Stereo and ToF Data Fusion
Abstract
This paper proposes a novel framework for the fusion of depth data produced by a Time-of-Flight (ToF) camera and a stereo vision system. The key problem of balancing between the two sources of information is solved by extracting confidence maps for both sources using deep learning. We introduce a novel synthetic dataset accurately representing the data acquired by the proposed setup and use it to train a Convolutional Neural Network architecture. The machine learning framework estimates the reliability of both data sources at each pixel location. The two depth fields are finally fused enforcing the local consistency of depth data taking into account the confidence information. Experimental results show that the proposed approach increases the accuracy of the depth estimation.
Cite
Text
Agresti et al. "Deep Learning for Confidence Information in Stereo and ToF Data Fusion." IEEE/CVF International Conference on Computer Vision Workshops, 2017. doi:10.1109/ICCVW.2017.88Markdown
[Agresti et al. "Deep Learning for Confidence Information in Stereo and ToF Data Fusion." IEEE/CVF International Conference on Computer Vision Workshops, 2017.](https://mlanthology.org/iccvw/2017/agresti2017iccvw-deep/) doi:10.1109/ICCVW.2017.88BibTeX
@inproceedings{agresti2017iccvw-deep,
title = {{Deep Learning for Confidence Information in Stereo and ToF Data Fusion}},
author = {Agresti, Gianluca and Minto, Ludovico and Marin, Giulio and Zanuttigh, Pietro},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2017},
pages = {697-705},
doi = {10.1109/ICCVW.2017.88},
url = {https://mlanthology.org/iccvw/2017/agresti2017iccvw-deep/}
}