Motion Patterns: High-Level Representation of Natural Video Sequences
Abstract
This work investigates the use of nonlinear dependencies in natural image sequence statistics to learn higher-order structures in natural videos. We propose a two-layer model that learns variance correlation between linear ICA coefficients and present a novel nonlinear representation of natural videos. The first layer performs a linear mapping from pixel values to ICA coefficients. In doing so, the spatiotemporal dynamics in natural videos are decomposed into a set of bases each encoding "independent motion." By assuming that the nonlinear dependency of ICA coefficients takes the form of variance correlation, the second layer learns the joint distribution of ICA sources that captures how these independent bases co-activate. Experimental results show that the abstract representation correspond to various activation patterns of bases with similar motion, hence the term "motion patterns." Our model offers a novel description of higher-order structures in natural videos. We illustrate the usefulness of the proposed representation in video segmentation and denoising tasks.
Cite
Text
Putthividhya and Lee. "Motion Patterns: High-Level Representation of Natural Video Sequences." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2006. doi:10.1109/CVPR.2006.191Markdown
[Putthividhya and Lee. "Motion Patterns: High-Level Representation of Natural Video Sequences." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2006.](https://mlanthology.org/cvpr/2006/putthividhya2006cvpr-motion/) doi:10.1109/CVPR.2006.191BibTeX
@inproceedings{putthividhya2006cvpr-motion,
title = {{Motion Patterns: High-Level Representation of Natural Video Sequences}},
author = {Putthividhya, Duangmanee and Lee, Te-Won},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2006},
pages = {1908-1915},
doi = {10.1109/CVPR.2006.191},
url = {https://mlanthology.org/cvpr/2006/putthividhya2006cvpr-motion/}
}