Space-Time Interest Points
Abstract
Local image features or interest points provide compact and abstract representations of patterns in an image. We propose to extend the notion of spatial interest points into the spatio-temporal domain and show how the resulting features often reflect interesting events that can be used for a compact representation of video data as well as for its interpretation. To detect spatio-temporal events, we build on the idea of the Harris and Forstner interest point operators and detect local structures in space-time where the image values have significant local variations in both space and time. We then estimate the spatio-temporal extents of the detected events and compute their scale-invariant spatio-temporal descriptors. Using such descriptors, we classify events and construct video representation in terms of labeled space-time points. For the problem of human motion analysis, we illustrate how the proposed method allows for detection of walking people in scenes with occlusions and dynamic backgrounds.
Cite
Text
Laptev and Lindeberg. "Space-Time Interest Points." IEEE/CVF International Conference on Computer Vision, 2003. doi:10.1109/ICCV.2003.1238378Markdown
[Laptev and Lindeberg. "Space-Time Interest Points." IEEE/CVF International Conference on Computer Vision, 2003.](https://mlanthology.org/iccv/2003/laptev2003iccv-space/) doi:10.1109/ICCV.2003.1238378BibTeX
@inproceedings{laptev2003iccv-space,
title = {{Space-Time Interest Points}},
author = {Laptev, Ivan and Lindeberg, Tony},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2003},
pages = {432-439},
doi = {10.1109/ICCV.2003.1238378},
url = {https://mlanthology.org/iccv/2003/laptev2003iccv-space/}
}