Learning Probabilistic Structure for Human Motion Detection
Abstract
Decomposable triangulated graphs have been shown to be efficient and effective for modeling the probabilistic spatio-temporal structure of brief stretches of human motion. In previous work such model structure was handcrafted by expert human observers and labeled data were needed for parameter learning. We present a method to build automatically the structure of the decomposable triangulated graph from unlabeled data. It is based on maximum-likelihood. Taking the labeling of the data as hidden variables, a variant of the EM algorithm can be applied. A greedy algorithm is developed to search for the optimal structure of the decomposable model based on the (conditional) differential entropy of variables. Our algorithm is demonstrated by learning models of human motion completely automatically from unlabeled real image sequences with clutter and occlusion. Experiments on both motion captured data and grayscale image sequences show that the resulting models perform better than the hand-constructed models.
Cite
Text
Song et al. "Learning Probabilistic Structure for Human Motion Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001. doi:10.1109/CVPR.2001.991043Markdown
[Song et al. "Learning Probabilistic Structure for Human Motion Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2001.](https://mlanthology.org/cvpr/2001/song2001cvpr-learning/) doi:10.1109/CVPR.2001.991043BibTeX
@inproceedings{song2001cvpr-learning,
title = {{Learning Probabilistic Structure for Human Motion Detection}},
author = {Song, Yang and Goncalves, Luis and Perona, Pietro},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2001},
pages = {II:771-777},
doi = {10.1109/CVPR.2001.991043},
url = {https://mlanthology.org/cvpr/2001/song2001cvpr-learning/}
}