Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints

Abstract

Adopting the Counting Grid (CG) representation [1], the Spring Lattice Counting Grid (SLCG) model uses a grid of feature counts to capture the spatial layout that a variety of images tend to follow. The images are mapped to the counting grid with their features rearranged so as to strike a balance between the mapping quality and the extent of the necessary rearrangement. In particular, the feature sets originating from different image sectors are mapped to different sub-windows in the counting grid in a configuration that is close, but not exactly the same as the configuration of the source sectors. The distribution over deformations of the sector configuration is learnable using a new spring lattice model, while the rearrangement of features within a sector is unconstrained. As a result, the CG model gains a more appropriate level of invariance to realistic image transformations like view point changes, rotations or scales. We tested SLCG on standard scene recognition datasets and on a dataset collected with a wearable camera which recorded the wearer’s visual input over three weeks. Our algorithm is capable of correctly classifying the visited locations more than 80% of the time, outperforming previous approaches to visual location recognition. At this level of performance, a variety of real-world applications of wearable cameras become feasible.

Cite

Text

Perina and Jojic. "Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33783-3_60

Markdown

[Perina and Jojic. "Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/perina2012eccv-spring/) doi:10.1007/978-3-642-33783-3_60

BibTeX

@inproceedings{perina2012eccv-spring,
  title     = {{Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints}},
  author    = {Perina, Alessandro and Jojic, Nebojsa},
  booktitle = {European Conference on Computer Vision},
  year      = {2012},
  pages     = {837-851},
  doi       = {10.1007/978-3-642-33783-3_60},
  url       = {https://mlanthology.org/eccv/2012/perina2012eccv-spring/}
}