Integrating and Employing Multiple Levels of Zoom for Activity Recognition

Abstract

To facilitate activity recognition, analysis of the scene at multiple levels of detail is necessary. Required prerequisites for our activity recognition are tracking objects across frames and establishing a consistent labeling of objects across cameras. This paper makes several innovative uses of the epipolar constraint in the context of activity recognition. We first demonstrate how we track heads and hands using the epipolar geometry. Next we show how the detected objects are labeled consistently across cameras and zooms by employing epipolar, spatial, trajectory, and appearance properties. Finally we show how our method, utilizing the multiple levels of detail, is able to answer activity recognition problems which are difficult to answer with a single level of detail.

Cite

Text

Smith et al. "Integrating and Employing Multiple Levels of Zoom for Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2004. doi:10.1109/CVPR.2004.133

Markdown

[Smith et al. "Integrating and Employing Multiple Levels of Zoom for Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2004.](https://mlanthology.org/cvpr/2004/smith2004cvpr-integrating/) doi:10.1109/CVPR.2004.133

BibTeX

@inproceedings{smith2004cvpr-integrating,
  title     = {{Integrating and Employing Multiple Levels of Zoom for Activity Recognition}},
  author    = {Smith, Paul and Shah, Mubarak and da Vitoria Lobo, Niels},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2004},
  pages     = {928-935},
  doi       = {10.1109/CVPR.2004.133},
  url       = {https://mlanthology.org/cvpr/2004/smith2004cvpr-integrating/}
}