Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints
Abstract
Adopting the Counting Grid (CG) representation [1], the Spring Lattice Counting Grid (SLCG) model uses a grid of feature counts to capture the spatial layout that a variety of images tend to follow. The images are mapped to the counting grid with their features rearranged so as to strike a balance between the mapping quality and the extent of the necessary rearrangement. In particular, the feature sets originating from different image sectors are mapped to different sub-windows in the counting grid in a configuration that is close, but not exactly the same as the configuration of the source sectors. The distribution over deformations of the sector configuration is learnable using a new spring lattice model, while the rearrangement of features within a sector is unconstrained. As a result, the CG model gains a more appropriate level of invariance to realistic image transformations like view point changes, rotations or scales. We tested SLCG on standard scene recognition datasets and on a dataset collected with a wearable camera which recorded the wearer’s visual input over three weeks. Our algorithm is capable of correctly classifying the visited locations more than 80% of the time, outperforming previous approaches to visual location recognition. At this level of performance, a variety of real-world applications of wearable cameras become feasible.
Cite
Text
Perina and Jojic. "Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints." European Conference on Computer Vision, 2012. doi:10.1007/978-3-642-33783-3_60Markdown
[Perina and Jojic. "Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints." European Conference on Computer Vision, 2012.](https://mlanthology.org/eccv/2012/perina2012eccv-spring/) doi:10.1007/978-3-642-33783-3_60BibTeX
@inproceedings{perina2012eccv-spring,
title = {{Spring Lattice Counting Grids: Scene Recognition Using Deformable Positional Constraints}},
author = {Perina, Alessandro and Jojic, Nebojsa},
booktitle = {European Conference on Computer Vision},
year = {2012},
pages = {837-851},
doi = {10.1007/978-3-642-33783-3_60},
url = {https://mlanthology.org/eccv/2012/perina2012eccv-spring/}
}