Hierarchy of Alternating Specialists for Scene Recognition

Abstract

We introduce a method for improving convolutional neural networks (CNNs) for scene classification. We present a hierarchy of specialist networks, which disentangles the intra-class variation and inter-class similarity in a coarse to fine manner. Our key insight is that each subset within a class is often associated with different types of inter-class similarity. This suggests that existing network of experts approaches that organize classes into coarse categories are suboptimal. In contrast, we group images based on high-level appearance features rather than their class membership and dedicate a specialist model per group. In addition, we propose an alternating architecture with a global ordered- and a global orderless-representation to account for both the coarse layout of the scene and the transient objects. We demonstrate that it leads to better performance than using a single type of representation as well as the fused features. We also introduce a mini-batch soft k-means that allows end-to-end fine-tuning, as well as a novel routing function for assigning images to specialists. Experimental results show that the proposed approach achieves a significant improvement over baselines including the existing tree-structured CNNs with class-based grouping.

Cite

Text

Jin Kim and Frahm. "Hierarchy of Alternating Specialists for Scene Recognition." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01252-6_28

Markdown

[Jin Kim and Frahm. "Hierarchy of Alternating Specialists for Scene Recognition." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/jinkim2018eccv-hierarchy/) doi:10.1007/978-3-030-01252-6_28

BibTeX

@inproceedings{jinkim2018eccv-hierarchy,
  title     = {{Hierarchy of Alternating Specialists for Scene Recognition}},
  author    = {Jin Kim, Hyo and Frahm, Jan-Michael},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2018},
  doi       = {10.1007/978-3-030-01252-6_28},
  url       = {https://mlanthology.org/eccv/2018/jinkim2018eccv-hierarchy/}
}