Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection

Abstract

This paper presents a novel descriptor for human detection in video sequence. It is referred to as spatial-temporal granularity -tunable gradients partition (STGGP), which is an extension of granularity-tunable gradients partition (GGP) from the still image domain to the spatial-temporal domain. Specifically, the moving human body is considered as a 3-dimensional entity in the spatial-temporal domain. Then in 3D Hough space, we define the generalized plane as a primitive to parse the structure of this 3D entity. The advantage of the generalized plane is that it can tolerate imperfect planes with certain level of uncertainty in rotation and translation. The robustness to the uncertainty is controlled quantitatively by the granularity parameters defined explicitly in the generalized plane. This property endows the STGGP descriptors versatile ability to represent both the deterministic structures and the statistical summarizations of the object. Moreover, the STGGP descriptor encodes much heterogeneous information such as the gradients’ strength, position, and distribution, as well as their temporal motion to enrich its representation ability. We evaluate the STGGP on human detection in sequence on the public datasets and very promising results have been achieved.

Cite

Text

Liu et al. "Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-15549-9_24

Markdown

[Liu et al. "Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/liu2010eccv-spatial/) doi:10.1007/978-3-642-15549-9_24

BibTeX

@inproceedings{liu2010eccv-spatial,
  title     = {{Spatial-Temporal Granularity-Tunable Gradients Partition (STGGP) Descriptors for Human Detection}},
  author    = {Liu, Yazhou and Shan, Shiguang and Chen, Xilin and Heikkilä, Janne and Gao, Wen and Pietikäinen, Matti},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {327-340},
  doi       = {10.1007/978-3-642-15549-9_24},
  url       = {https://mlanthology.org/eccv/2010/liu2010eccv-spatial/}
}