Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions
Abstract
Visual recognition of physical actions requires temporal segmentation and identification of action types. Action concepts are analyzed into attention, context, and change. Temporal segmentation is defined as a context switch detected by a switching of attention. Actions are identified by detecting “indexical” features which can be quickly calculated from visual features and directly point to action concepts. Validity of the indexicality depends on the attention and the context. These are maintained by three types of attention control: spatial, temporal and hierarchical. They are combined by a mechanism called “attention stack”, which extends at important points and winds up elsewhere. An action recognizer built upon the framework successfully recognized human assembly action sequences in real time and output qualitative descriptions of the tasks.
Cite
Text
Kuniyoshi and Inoue. "Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions." European Conference on Computer Vision, 1992. doi:10.1007/3-540-55426-2_101Markdown
[Kuniyoshi and Inoue. "Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions." European Conference on Computer Vision, 1992.](https://mlanthology.org/eccv/1992/kuniyoshi1992eccv-indexicality/) doi:10.1007/3-540-55426-2_101BibTeX
@inproceedings{kuniyoshi1992eccv-indexicality,
title = {{Indexicality and Dynamic Attention Control in Qualitative Recogniton of Assembly Actions}},
author = {Kuniyoshi, Yasuo and Inoue, Hirochika},
booktitle = {European Conference on Computer Vision},
year = {1992},
pages = {874-878},
doi = {10.1007/3-540-55426-2_101},
url = {https://mlanthology.org/eccv/1992/kuniyoshi1992eccv-indexicality/}
}