Internet Video Category Recognition
Abstract
In this paper, we examine the problem of internet video categorization. Specifically, we explore the representation of a video as a ldquobag of wordsrdquo using various combinations of spatial and temporal descriptors. The descriptors incorporate both spatial and temporal gradients as well as optical flow information. We achieve state-of-the-art results on a standard human activity recognition database and demonstrate promising category recognition performance on two new databases of approximately 1000 and 1500 online user-submitted videos, which we will be making available to the community.
Cite
Text
Schindler et al. "Internet Video Category Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2008. doi:10.1109/CVPRW.2008.4562960Markdown
[Schindler et al. "Internet Video Category Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2008.](https://mlanthology.org/cvprw/2008/schindler2008cvprw-internet/) doi:10.1109/CVPRW.2008.4562960BibTeX
@inproceedings{schindler2008cvprw-internet,
title = {{Internet Video Category Recognition}},
author = {Schindler, Grant and Zitnick, Larry and Brown, Matthew A.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2008},
pages = {1-7},
doi = {10.1109/CVPRW.2008.4562960},
url = {https://mlanthology.org/cvprw/2008/schindler2008cvprw-internet/}
}