Semi-Automatic Stereo Extraction from Video Footage
Abstract
We present a semi-automatic system that converts conventional video shots to stereoscopic video pairs. The system requires just a few user-scribbles in a sparse set of frames. The system combines a diffusion scheme, which takes into account the local saliency and the local motion at each video location, coupled with a classification scheme that assigns depth to image patches. The system tolerates both scene motion and camera motion. In typical shots, containing hundreds of frames, even in the face of significant motion, it is enough to mark scribbles on the first and last frames of the shot. Once marked, plausible stereo results are obtained in a matter of seconds, leading to a scalable video conversion system. Finally, we validate our results with ground truth stereo video.
Cite
Text
Guttmann et al. "Semi-Automatic Stereo Extraction from Video Footage." IEEE/CVF International Conference on Computer Vision, 2009. doi:10.1109/ICCV.2009.5459158Markdown
[Guttmann et al. "Semi-Automatic Stereo Extraction from Video Footage." IEEE/CVF International Conference on Computer Vision, 2009.](https://mlanthology.org/iccv/2009/guttmann2009iccv-semi/) doi:10.1109/ICCV.2009.5459158BibTeX
@inproceedings{guttmann2009iccv-semi,
title = {{Semi-Automatic Stereo Extraction from Video Footage}},
author = {Guttmann, Moshe and Wolf, Lior and Cohen-Or, Daniel},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2009},
pages = {136-142},
doi = {10.1109/ICCV.2009.5459158},
url = {https://mlanthology.org/iccv/2009/guttmann2009iccv-semi/}
}