Nested Motion Descriptors

Abstract

A nested motion descriptor is a spatiotemporal representation of motion that is invariant to global camera translation, without requiring an explicit estimate of optical flow or camera stabilization. This descriptor is a natural spatiotemporal extension of the nested shape descriptor to the representation of motion. We demonstrate that the quadrature steerable pyramid can be used to pool phase, and that pooling phase rather than magnitude provides an estimate of camera motion. This motion can be removed using the log-spiral normalization as introduced in the nested shape descriptor. Furthermore, this structure enables an elegant visualization of salient motion using the reconstruction properties of the steerable pyramid. We compare our descriptor to local motion descriptors, HOG-3D and HOG-HOF, and show improvements on three activity recognition datasets.

Cite

Text

Byrne. "Nested Motion Descriptors." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7298648

Markdown

[Byrne. "Nested Motion Descriptors." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/byrne2015cvpr-nested/) doi:10.1109/CVPR.2015.7298648

BibTeX

@inproceedings{byrne2015cvpr-nested,
  title     = {{Nested Motion Descriptors}},
  author    = {Byrne, Jeffrey},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7298648},
  url       = {https://mlanthology.org/cvpr/2015/byrne2015cvpr-nested/}
}