Human Detection Using Learned Part Alphabet and Pose Dictionary

Abstract

As structured data, human body and text are similar in many aspects. In this paper, we make use of the analogy between human body and text to build a compositional model for human detection in natural scenes. Basic concepts and mature techniques in text recognition are introduced into this model. A discriminative alphabet, each grapheme of which is a mid-level element representing a body part, is automatically learned from bounding box labels. Based on this alphabet, the flexible structure of human body is expressed by means of symbolic sequences, which correspond to various human poses and allow for robust, efficient matching. A pose dictionary is constructed from training examples, which is used to verify hypotheses at runtime. Experiments on standard benchmarks demonstrate that the proposed algorithm achieves state-of-the-art or competitive performance.

Cite

Text

Yao et al. "Human Detection Using Learned Part Alphabet and Pose Dictionary." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10602-1_17

Markdown

[Yao et al. "Human Detection Using Learned Part Alphabet and Pose Dictionary." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/yao2014eccv-human/) doi:10.1007/978-3-319-10602-1_17

BibTeX

@inproceedings{yao2014eccv-human,
  title     = {{Human Detection Using Learned Part Alphabet and Pose Dictionary}},
  author    = {Yao, Cong and Bai, Xiang and Liu, Wenyu and Latecki, Longin Jan},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {251-266},
  doi       = {10.1007/978-3-319-10602-1_17},
  url       = {https://mlanthology.org/eccv/2014/yao2014eccv-human/}
}