Localizing Actions Through Sequential 2D Video Projections

Abstract

Action detection in video is a particularly difficult problem because actions must not only be recognized correctly, but must also be localized in the 3D spatio-temporal volume. This paper introduces a technique that transforms the 3D localization problem into a series of 2D detection tasks. This is accomplished by dividing the video into overlapping segments, then representing each segment with a 2D video projection. The advantage of the 2D projection is that it makes it convenient to apply the best techniques from object detection to the action detection problem. Our experiments show that video projection outperforms the latest results on action detection in a direct comparison. 1.

Cite

Text

Boyraz et al. "Localizing Actions Through Sequential 2D Video Projections." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2011. doi:10.1109/CVPRW.2011.5981747

Markdown

[Boyraz et al. "Localizing Actions Through Sequential 2D Video Projections." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2011.](https://mlanthology.org/cvprw/2011/boyraz2011cvprw-localizing/) doi:10.1109/CVPRW.2011.5981747

BibTeX

@inproceedings{boyraz2011cvprw-localizing,
  title     = {{Localizing Actions Through Sequential 2D Video Projections}},
  author    = {Boyraz, Hakan and Tappen, Marshall F. and Sukthankar, Rahul},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2011},
  pages     = {34-39},
  doi       = {10.1109/CVPRW.2011.5981747},
  url       = {https://mlanthology.org/cvprw/2011/boyraz2011cvprw-localizing/}
}