A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing
Abstract
In this work we describe a novel statistical video representation and modeling scheme. Video representation schemes are needed to enable segmenting a video stream into meaningful video-objects, useful for later indexing and retrieval applications. In the proposed methodology, unsupervised clustering via Guassian mixture modeling extracts coherent space-time regions in feature space, and corresponding coherent segments ( video-regions ) in the video content. A key feature of the system is the analysis of video input as a single entity as opposed to a sequence of separate frames. Space and time are treated uniformly. The extracted space-time regions allow for the detection and recognition of video events. Results of segmenting video content into static vs. dynamic video regions and video content editing are presented.
Cite
Text
Greenspan et al. "A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing." European Conference on Computer Vision, 2002. doi:10.1007/3-540-47979-1_31Markdown
[Greenspan et al. "A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing." European Conference on Computer Vision, 2002.](https://mlanthology.org/eccv/2002/greenspan2002eccv-probabilistic/) doi:10.1007/3-540-47979-1_31BibTeX
@inproceedings{greenspan2002eccv-probabilistic,
title = {{A Probabilistic Framework for Spatio-Temporal Video Representation & Indexing}},
author = {Greenspan, Hayit and Goldberger, Jacob and Mayer, Arnaldo},
booktitle = {European Conference on Computer Vision},
year = {2002},
pages = {461-475},
doi = {10.1007/3-540-47979-1_31},
url = {https://mlanthology.org/eccv/2002/greenspan2002eccv-probabilistic/}
}