Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions

Abstract

Visual recognition of physical actions requires temporal segmentation and identification of action types. Action concepts are analyzed into attention, context, and change. Temporal segmentation is defined as a context switch detected by a switching of attention. Actions are identified by detecting “indexical” features which can be quickly calculated from visual features and directly point to action concepts. Validity of the indexicality depends on the attention and the context. These are maintained by three types of attention control: spatial, temporal and hierarchical. They are combined by a mechanism called “attention stack”, which extends at important points and winds up elsewhere. An action recognizer built upon the framework successfully recognized human assembly action sequences in real time and output qualitative descriptions of the tasks.

Cite

Text

Kuniyoshi and Inoue. "Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions." European Conference on Computer Vision, 1992. doi:10.1007/3-540-55426-2_101

Markdown

[Kuniyoshi and Inoue. "Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions." European Conference on Computer Vision, 1992.](https://mlanthology.org/eccv/1992/kuniyoshi1992eccv-indexicality/) doi:10.1007/3-540-55426-2_101

BibTeX

@inproceedings{kuniyoshi1992eccv-indexicality,
  title     = {{Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions}},
  author    = {Kuniyoshi, Yasuo and Inoue, Hirochika},
  booktitle = {European Conference on Computer Vision},
  year      = {1992},
  pages     = {874-878},
  doi       = {10.1007/3-540-55426-2_101},
  url       = {https://mlanthology.org/eccv/1992/kuniyoshi1992eccv-indexicality/}
}