Video Object Proposals

Abstract

In this paper, we extend a recently proposed method for generic object detection in images, category-independent object proposals, to the case of video. Given a video, the output of our algorithm is a set of video segments that are likely to contain an object. This can be useful, e.g., as a first step in a video object detection system. Given the sheer amount of pixels in a video, a straightforward extension of the 2D methods to a 3D (spatiotemporal) volume is not feasible. Instead, we start by extracting object proposals in each frame separately. These are linked across frames into object hypotheses, which are then used as higher-order potentials in a graph-based video segmentation framework. Running multiple segmentations and ranking the segments based on the likelihood that they correspond to an object, yields our final set of video object proposals. © 2012 IEEE.

Cite

Text

Sharir and Tuytelaars. "Video Object Proposals." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2012. doi:10.1109/CVPRW.2012.6239254

Markdown

[Sharir and Tuytelaars. "Video Object Proposals." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2012.](https://mlanthology.org/cvprw/2012/sharir2012cvprw-video/) doi:10.1109/CVPRW.2012.6239254

BibTeX

@inproceedings{sharir2012cvprw-video,
  title     = {{Video Object Proposals}},
  author    = {Sharir, Gilad and Tuytelaars, Tinne},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2012},
  pages     = {9-14},
  doi       = {10.1109/CVPRW.2012.6239254},
  url       = {https://mlanthology.org/cvprw/2012/sharir2012cvprw-video/}
}