Semi-Automatic Stereo Extraction from Video Footage

Abstract

We present a semi-automatic system that converts conventional video shots to stereoscopic video pairs. The system requires just a few user-scribbles in a sparse set of frames. The system combines a diffusion scheme, which takes into account the local saliency and the local motion at each video location, coupled with a classification scheme that assigns depth to image patches. The system tolerates both scene motion and camera motion. In typical shots, containing hundreds of frames, even in the face of significant motion, it is enough to mark scribbles on the first and last frames of the shot. Once marked, plausible stereo results are obtained in a matter of seconds, leading to a scalable video conversion system. Finally, we validate our results with ground truth stereo video.

Cite

Text

Guttmann et al. "Semi-Automatic Stereo Extraction from Video Footage." IEEE/CVF International Conference on Computer Vision, 2009. doi:10.1109/ICCV.2009.5459158

Markdown

[Guttmann et al. "Semi-Automatic Stereo Extraction from Video Footage." IEEE/CVF International Conference on Computer Vision, 2009.](https://mlanthology.org/iccv/2009/guttmann2009iccv-semi/) doi:10.1109/ICCV.2009.5459158

BibTeX

@inproceedings{guttmann2009iccv-semi,
  title     = {{Semi-Automatic Stereo Extraction from Video Footage}},
  author    = {Guttmann, Moshe and Wolf, Lior and Cohen-Or, Daniel},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2009},
  pages     = {136-142},
  doi       = {10.1109/ICCV.2009.5459158},
  url       = {https://mlanthology.org/iccv/2009/guttmann2009iccv-semi/}
}