Localizing Actions Through Sequential 2D Video Projections
Abstract
Action detection in video is a particularly difficult problem because actions must not only be recognized correctly, but must also be localized in the 3D spatio-temporal volume. This paper introduces a technique that transforms the 3D localization problem into a series of 2D detection tasks. This is accomplished by dividing the video into overlapping segments, then representing each segment with a 2D video projection. The advantage of the 2D projection is that it makes it convenient to apply the best techniques from object detection to the action detection problem. Our experiments show that video projection outperforms the latest results on action detection in a direct comparison. 1.
Cite
Text
Boyraz et al. "Localizing Actions Through Sequential 2D Video Projections." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2011. doi:10.1109/CVPRW.2011.5981747Markdown
[Boyraz et al. "Localizing Actions Through Sequential 2D Video Projections." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2011.](https://mlanthology.org/cvprw/2011/boyraz2011cvprw-localizing/) doi:10.1109/CVPRW.2011.5981747BibTeX
@inproceedings{boyraz2011cvprw-localizing,
title = {{Localizing Actions Through Sequential 2D Video Projections}},
author = {Boyraz, Hakan and Tappen, Marshall F. and Sukthankar, Rahul},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2011},
pages = {34-39},
doi = {10.1109/CVPRW.2011.5981747},
url = {https://mlanthology.org/cvprw/2011/boyraz2011cvprw-localizing/}
}