Extracting Spatiotemporal Interest Points Using Global Information
Abstract
Local spatiotemporal features or interest points provide compact but descriptive representations for efficient video analysis and motion recognition. Current local feature extraction approaches involve either local filtering or entropy computation which ignore global information (e.g. large blobs of moving pixels) in video inputs. This paper presents a novel extraction method which utilises global information from each video input so that moving parts such as a moving hand can be identified and are used to select relevant interest points for a condensed representation. The proposed method involves obtaining a small set of subspace images, which can synthesise frames in the video input from their corresponding coefficient vectors, and then detecting interest points from the subspaces and the coefficient vectors. Experimental results indicate that the proposed method can yield a sparser set of interest points for motion recognition than existing methods.
Cite
Text
Wong and Cipolla. "Extracting Spatiotemporal Interest Points Using Global Information." IEEE/CVF International Conference on Computer Vision, 2007. doi:10.1109/ICCV.2007.4408923Markdown
[Wong and Cipolla. "Extracting Spatiotemporal Interest Points Using Global Information." IEEE/CVF International Conference on Computer Vision, 2007.](https://mlanthology.org/iccv/2007/wong2007iccv-extracting/) doi:10.1109/ICCV.2007.4408923BibTeX
@inproceedings{wong2007iccv-extracting,
title = {{Extracting Spatiotemporal Interest Points Using Global Information}},
author = {Wong, Shu-Fai and Cipolla, Roberto},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2007},
pages = {1-8},
doi = {10.1109/ICCV.2007.4408923},
url = {https://mlanthology.org/iccv/2007/wong2007iccv-extracting/}
}