Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning

Abstract

This paper proposes a learning strategy that embeds object-part concepts into a pre-trained convolutional neural network (CNN), in an attempt to 1) explore explicit semantics hidden in CNN units and 2) gradually transform the pre-trained CNN into a semantically interpretable graphical model for hierarchical object understanding. Given part annotations on very few (e.g., 3-12) objects, our method mines certain latent patterns from the pre-trained CNN and associates them with different semantic parts. We use a four-layer And-Or graph to organize the CNN units, so as to clarify their internal semantic hierarchy. Our method is guided by a small number of part annotations, and it achieves superior part-localization performance (about 13%-107% improvement in part center prediction on the PASCAL VOC and ImageNet datasets)

Cite

Text

Zhang et al. "Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10924

Markdown

[Zhang et al. "Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/zhang2017aaai-growing/) doi:10.1609/AAAI.V31I1.10924

BibTeX

@inproceedings{zhang2017aaai-growing,
  title     = {{Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning}},
  author    = {Zhang, Quanshi and Cao, Ruiming and Wu, Ying Nian and Zhu, Song-Chun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {2898-2906},
  doi       = {10.1609/AAAI.V31I1.10924},
  url       = {https://mlanthology.org/aaai/2017/zhang2017aaai-growing/}
}