Extracting Spatiotemporal Interest Points Using Global Information

Abstract

Local spatiotemporal features or interest points provide compact but descriptive representations for efficient video analysis and motion recognition. Current local feature extraction approaches involve either local filtering or entropy computation which ignore global information (e.g. large blobs of moving pixels) in video inputs. This paper presents a novel extraction method which utilises global information from each video input so that moving parts such as a moving hand can be identified and are used to select relevant interest points for a condensed representation. The proposed method involves obtaining a small set of subspace images, which can synthesise frames in the video input from their corresponding coefficient vectors, and then detecting interest points from the subspaces and the coefficient vectors. Experimental results indicate that the proposed method can yield a sparser set of interest points for motion recognition than existing methods.

Cite

Text

Wong and Cipolla. "Extracting Spatiotemporal Interest Points Using Global Information." IEEE/CVF International Conference on Computer Vision, 2007. doi:10.1109/ICCV.2007.4408923

Markdown

[Wong and Cipolla. "Extracting Spatiotemporal Interest Points Using Global Information." IEEE/CVF International Conference on Computer Vision, 2007.](https://mlanthology.org/iccv/2007/wong2007iccv-extracting/) doi:10.1109/ICCV.2007.4408923

BibTeX

@inproceedings{wong2007iccv-extracting,
  title     = {{Extracting Spatiotemporal Interest Points Using Global Information}},
  author    = {Wong, Shu-Fai and Cipolla, Roberto},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2007},
  pages     = {1-8},
  doi       = {10.1109/ICCV.2007.4408923},
  url       = {https://mlanthology.org/iccv/2007/wong2007iccv-extracting/}
}