Learning Probabilistic Structure for Human Motion Detection

Abstract

Decomposable triangulated graphs have been shown to be efficient and effective for modeling the probabilistic spatio-temporal structure of brief stretches of human motion. In previous work such model structure was handcrafted by expert human observers and labeled data were needed for parameter learning. We present a method to build automatically the structure of the decomposable triangulated graph from unlabeled data. It is based on maximum-likelihood. Taking the labeling of the data as hidden variables, a variant of the EM algorithm can be applied. A greedy algorithm is developed to search for the optimal structure of the decomposable model based on the (conditional) differential entropy of variables. Our algorithm is demonstrated by learning models of human motion completely automatically from unlabeled real image sequences with clutter and occlusion. Experiments on both motion captured data and grayscale image sequences show that the resulting models perform better than the hand-constructed models.

Cite

Text

Song et al. "Learning Probabilistic Structure for Human Motion Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001. doi:10.1109/CVPR.2001.991043

Markdown

[Song et al. "Learning Probabilistic Structure for Human Motion Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001.](https://mlanthology.org/cvpr/2001/song2001cvpr-learning/) doi:10.1109/CVPR.2001.991043

BibTeX

@inproceedings{song2001cvpr-learning,
  title     = {{Learning Probabilistic Structure for Human Motion Detection}},
  author    = {Song, Yang and Goncalves, Luis and Perona, Pietro},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2001},
  pages     = {II:771-777},
  doi       = {10.1109/CVPR.2001.991043},
  url       = {https://mlanthology.org/cvpr/2001/song2001cvpr-learning/}
}