Learning Unseen Concepts via Hierarchical Decomposition and Composition

Abstract

Composing and recognizing new concepts from known sub-concepts has been a fundamental and challenging vision task, mainly due to 1) the diversity of sub-concepts and 2) the intricate contextuality between sub-concepts and their corresponding visual features. However, most of the current methods simply treat the contextuality as rigid semantic relationships and fail to capture fine-grained contextual correlations. We propose to learn unseen concepts in a hierarchical decomposition-and-composition manner. Considering the diversity of sub-concepts, our method decomposes each seen image into visual elements according to its labels, and learns corresponding sub-concepts in their individual subspaces. To model intricate contextuality between sub-concepts and their visual features, compositions are generated from these subspaces in three hierarchical forms, and the composed concepts are learned in a unified composition space. To further refine the captured contextual relationships, adaptively semi-positive concepts are defined and then learned with pseudo supervision exploited from the generated compositions. We validate the proposed approach on two challenging benchmarks, and demonstrate its superiority over state-of-the-art approaches.

Cite

Text

Yang et al. "Learning Unseen Concepts via Hierarchical Decomposition and Composition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01026

Markdown

[Yang et al. "Learning Unseen Concepts via Hierarchical Decomposition and Composition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/yang2020cvpr-learning-e/) doi:10.1109/CVPR42600.2020.01026

BibTeX

@inproceedings{yang2020cvpr-learning-e,
  title     = {{Learning Unseen Concepts via Hierarchical Decomposition and Composition}},
  author    = {Yang, Muli and Deng, Cheng and Yan, Junchi and Liu, Xianglong and Tao, Dacheng},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01026},
  url       = {https://mlanthology.org/cvpr/2020/yang2020cvpr-learning-e/}
}