Pose Filter Based Hidden-CRF Models for Activity Detection
Abstract
Detecting activities which involve a sequence of complex pose and motion changes in unsegmented videos is a challenging task, and common approaches use sequential graphical models to infer the human pose-state in every frame. We propose an alternative model based on detecting the key-poses in a video, where only the temporal positions of a few key-poses are inferred. We also introduce a novel pose summarization algorithm to automatically discover the key-poses of an activity. We learn a detection filter for each key-pose, which along with a bag-of-words root filter are combined in an HCRF model, whose parameters are learned using the latent-SVM optimization. We evaluate the performance of our model for detection on unsegmented videos on four human action datasets, which include challenging crowded scenes with dynamic backgrounds, inter-person occlusions, multi-human interactions and hard-to-detect daily use objects.
Cite
Text
Banerjee and Nevatia. "Pose Filter Based Hidden-CRF Models for Activity Detection." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10605-2_46Markdown
[Banerjee and Nevatia. "Pose Filter Based Hidden-CRF Models for Activity Detection." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/banerjee2014eccv-pose/) doi:10.1007/978-3-319-10605-2_46BibTeX
@inproceedings{banerjee2014eccv-pose,
title = {{Pose Filter Based Hidden-CRF Models for Activity Detection}},
author = {Banerjee, Prithviraj and Nevatia, Ramakant},
booktitle = {European Conference on Computer Vision},
year = {2014},
pages = {711-726},
doi = {10.1007/978-3-319-10605-2_46},
url = {https://mlanthology.org/eccv/2014/banerjee2014eccv-pose/}
}