Unsupervised Learning of Hierarchical Spatial Structures in Images

Abstract

The visual world demonstrates organized spatial patterns, among objects or regions in a scene, object-parts in an object, and low-level features in object-parts. These classes of spatial structures are inherently hierarchical in nature. Although seemingly quite different these spatial patterns are simply manifestations of different levels in a hierarchy. In this work, we present a unified approach to unsupervised learning of hierarchical spatial structures from a collection of images. Ours is a hierarchical rule-based model capturing spatial patterns, where each rule is represented by a star-graph. We propose an unsupervised EM-style algorithm to learn our model from a collection of images. We show that the inference problem of determining the set of learnt rules instantiated in an image is equivalent to finding the minimum-cost Steiner tree in a directed acyclic graph. We evaluate our approach on a diverse set of data sets of object categories, natural outdoor scenes and images from complex street scenes with multiple objects.

Cite

Text

Parikh et al. "Unsupervised Learning of Hierarchical Spatial Structures in Images." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206549

Markdown

[Parikh et al. "Unsupervised Learning of Hierarchical Spatial Structures in Images." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/parikh2009cvpr-unsupervised/) doi:10.1109/CVPR.2009.5206549

BibTeX

@inproceedings{parikh2009cvpr-unsupervised,
  title     = {{Unsupervised Learning of Hierarchical Spatial Structures in Images}},
  author    = {Parikh, Devi and Zitnick, C. Lawrence and Chen, Tsuhan},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {2743-2750},
  doi       = {10.1109/CVPR.2009.5206549},
  url       = {https://mlanthology.org/cvpr/2009/parikh2009cvpr-unsupervised/}
}