Learning Semantic Neural Tree for Human Parsing

Abstract

In this paper, we design a novel semantic neural tree for human parsing, which uses a tree architecture to encode physiological structure of human body, and design a coarse to fine process in a cascade manner to generate accurate results. Specifically, the semantic neural tree is designed to segment human regions into multiple semantic sub-regions (g, face, arms, and legs) in a hierarchical way using a new designed attention routing module. Meanwhile, we introduce the semantic aggregation module to combine multiple hierarchical features to exploit more context information for better performance. Our semantic neural tree can be trained in an end-to-end fashion by standard stochastic gradient descent (SGD) with back-propagation. Several experiments conducted on four challenging datasets for both single and multiple human parsing, \ie, LIP, PASCAL-Person-Part, CIHP and MHP-v2, demonstrate the effectiveness of the proposed method. Code can be found at \url{https://isrc.iscas.ac.cn/gitlab/research/sematree}.

Cite

Text

Ji et al. "Learning Semantic Neural Tree for Human Parsing." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58601-0_13

Markdown

[Ji et al. "Learning Semantic Neural Tree for Human Parsing." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/ji2020eccv-learning/) doi:10.1007/978-3-030-58601-0_13

BibTeX

@inproceedings{ji2020eccv-learning,
  title     = {{Learning Semantic Neural Tree for Human Parsing}},
  author    = {Ji, Ruyi and Du, Dawei and Zhang, Libo and Wen, Longyin and Wu, Yanjun and Zhao, Chen and Huang, Feiyue and Lyu, Siwei},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58601-0_13},
  url       = {https://mlanthology.org/eccv/2020/ji2020eccv-learning/}
}