A Spatio-Temporal Feature Based on Triangulation of Dense SURF

Abstract

In this paper, we propose a spatio-temporal feature which is based on the appearance and movement of interest SURF key points. Given a video, we extract its spatiotemporal features according to every small set of frames. For each frame set, we first extract dense SURF key points from its first frame and estimate their optical flows at each frame. We then detect camera motion and compensate flow vectors in case camera motion exists. Next, we select interest points based on their movement based relationship through the frame set. We then apply Delaunay triangulation to form triangles of selected points. From each triangle we extract its shape feature along with trajectory based visual features of its points. We show that concatenating these features with SURF feature can form a spatio-temporal feature which is comparable to the state of the art. Our proposed spatio-temporal feature is supposed to be robust and informative since it is not based on characteristics of individual points but groups of related interest points. We apply Fisher Vector encoding to represent videos using the proposed feature. We conduct various experiments on UCF-101, the largest action dataset of realistic videos up to date, and show the effectiveness of our proposed method.

Cite

Text

Nga and Yanai. "A Spatio-Temporal Feature Based on Triangulation of Dense SURF." IEEE/CVF International Conference on Computer Vision Workshops, 2013. doi:10.1109/ICCVW.2013.62

Markdown

[Nga and Yanai. "A Spatio-Temporal Feature Based on Triangulation of Dense SURF." IEEE/CVF International Conference on Computer Vision Workshops, 2013.](https://mlanthology.org/iccvw/2013/nga2013iccvw-spatiotemporal/) doi:10.1109/ICCVW.2013.62

BibTeX

@inproceedings{nga2013iccvw-spatiotemporal,
  title     = {{A Spatio-Temporal Feature Based on Triangulation of Dense SURF}},
  author    = {Nga, Do Hang and Yanai, Keiji},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2013},
  pages     = {420-427},
  doi       = {10.1109/ICCVW.2013.62},
  url       = {https://mlanthology.org/iccvw/2013/nga2013iccvw-spatiotemporal/}
}