SST: Single-Stream Temporal Action Proposals

Abstract

Our paper presents a new approach for temporal detection of human actions in long, untrimmed video sequences. We introduce Single-Stream Temporal Action Proposals (SST), a new effective and efficient deep architecture for the generation of temporal action proposals. Our network can run continuously in a single stream over very long input video sequences, without the need to divide input into short overlapping clips or temporal windows for batch processing. We demonstrate empirically that our model outperforms the state-of-the-art on the task of temporal action proposal generation, while achieving some of the fastest processing speeds in the literature. Finally, we demonstrate that using SST proposals in conjunction with existing action classifiers results in improved state-of-the-art temporal action detection performance.

Cite

Text

Buch et al. "SST: Single-Stream Temporal Action Proposals." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.675

Markdown

[Buch et al. "SST: Single-Stream Temporal Action Proposals." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/buch2017cvpr-sst/) doi:10.1109/CVPR.2017.675

BibTeX

@inproceedings{buch2017cvpr-sst,
  title     = {{SST: Single-Stream Temporal Action Proposals}},
  author    = {Buch, Shyamal and Escorcia, Victor and Shen, Chuanqi and Ghanem, Bernard and Niebles, Juan Carlos},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.675},
  url       = {https://mlanthology.org/cvpr/2017/buch2017cvpr-sst/}
}