Active Scene Recognition with Vision and Language
Abstract
This paper presents a novel approach to utilizing high level knowledge for the problem of scene recognition in an active vision framework, which we call active scene recognition. In traditional approaches, high level knowledge is used in the post-processing to combine the outputs of the object detectors to achieve better classification performance. In contrast, the proposed approach employs high level knowledge actively by implementing an interaction between a reasoning module and a sensory module (Figure 1). Following this paradigm, we implemented an active scene recognizer and evaluated it with a dataset of 20 scenes and 100+ objects. We also extended it to the analysis of dynamic scenes for activity recognition with attributes. Experiments demonstrate the effectiveness of the active paradigm in introducing attention and additional constraints into the sensing process.
Cite
Text
Yu et al. "Active Scene Recognition with Vision and Language." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126320Markdown
[Yu et al. "Active Scene Recognition with Vision and Language." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/yu2011iccv-active/) doi:10.1109/ICCV.2011.6126320BibTeX
@inproceedings{yu2011iccv-active,
title = {{Active Scene Recognition with Vision and Language}},
author = {Yu, Xiaodong and Fermüller, Cornelia and Teo, Ching Lik and Yang, Yezhou and Aloimonos, Yiannis},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2011},
pages = {810-817},
doi = {10.1109/ICCV.2011.6126320},
url = {https://mlanthology.org/iccv/2011/yu2011iccv-active/}
}