A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing

Abstract

In this work we describe a novel statistical video representation and modeling scheme. Video representation schemes are needed to enable segmenting a video stream into meaningful video-objects, useful for later indexing and retrieval applications. In the proposed methodology, unsupervised clustering via Guassian mixture modeling extracts coherent space-time regions in feature space, and corresponding coherent segments ( video-regions ) in the video content. A key feature of the system is the analysis of video input as a single entity as opposed to a sequence of separate frames. Space and time are treated uniformly. The extracted space-time regions allow for the detection and recognition of video events. Results of segmenting video content into static vs. dynamic video regions and video content editing are presented.

Cite

Text

Greenspan et al. "A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing." European Conference on Computer Vision, 2002. doi:10.1007/3-540-47979-1_31

Markdown

[Greenspan et al. "A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing." European Conference on Computer Vision, 2002.](https://mlanthology.org/eccv/2002/greenspan2002eccv-probabilistic/) doi:10.1007/3-540-47979-1_31

BibTeX

@inproceedings{greenspan2002eccv-probabilistic,
  title     = {{A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing}},
  author    = {Greenspan, Hayit and Goldberger, Jacob and Mayer, Arnaldo},
  booktitle = {European Conference on Computer Vision},
  year      = {2002},
  pages     = {461-475},
  doi       = {10.1007/3-540-47979-1_31},
  url       = {https://mlanthology.org/eccv/2002/greenspan2002eccv-probabilistic/}
}