Q-Learning of Sequential Attention for Visual Object Recognition from Informative Local Descriptors

Paletta, Lucas; Fritz, Gerald; Seifert, Christin

doi:10.1145/1102351.1102433

Q-Learning of Sequential Attention for Visual Object Recognition from Informative Local Descriptors

Lucas Paletta, Gerald Fritz, Christin Seifert

ICML 2005 pp. 649-656

doi:10.1145/1102351.1102433 /icml/2005/paletta2005icml-q/

Abstract

This work provides a framework for learning sequential attention in real-world visual object recognition, using an architecture of three processing stages. The first stage rejects irrelevant local descriptors based on an information theoretic saliency measure, providing candidates for foci of interest (FOI). The second stage investigates the information in the FOI using a codebook matcher and providing weak object hypotheses. The third stage integrates local information via shifts of attention, resulting in chains of descriptor-action pairs that characterize object discrimination. A Q-learner adapts then from explorative search and evaluative feedback from entropy decreases on the attention sequences, eventually prioritizing shifts that lead to a geometry of descriptor-action scanpaths that is highly discriminative with respect to object recognition. The methodology is successfully evaluated on indoors (COIL-20 database) and outdoors (TSG-20 database) imagery, demonstrating significant impact by learning, outperforming standard local descriptor based methods both in recognition accuracy and processing time.

PDF ICML Semantic Scholar

Cite

Text

Paletta et al. "Q-Learning of Sequential Attention for Visual Object Recognition from Informative Local Descriptors." International Conference on Machine Learning, 2005. doi:10.1145/1102351.1102433

Markdown

[Paletta et al. "Q-Learning of Sequential Attention for Visual Object Recognition from Informative Local Descriptors." International Conference on Machine Learning, 2005.](https://mlanthology.org/icml/2005/paletta2005icml-q/) doi:10.1145/1102351.1102433

BibTeX

@inproceedings{paletta2005icml-q,
  title     = {{Q-Learning of Sequential Attention for Visual Object Recognition from Informative Local Descriptors}},
  author    = {Paletta, Lucas and Fritz, Gerald and Seifert, Christin},
  booktitle = {International Conference on Machine Learning},
  year      = {2005},
  pages     = {649-656},
  doi       = {10.1145/1102351.1102433},
  url       = {https://mlanthology.org/icml/2005/paletta2005icml-q/}
}