Scene Classification with Low-Dimensional Semantic Spaces and Weak Supervision
Abstract
A novel approach to scene categorization is proposed. Similar to previous works of [11, 15, 3, 12], we introduce an intermediate space, based on a low dimensional semantic ldquothemerdquo image representation. However, instead of learning the themes in an unsupervised manner, they are learned with weak supervision, from casual image annotations. Each theme induces a probability density on the space of low-level features, and images are represented as vectors of posterior theme probabilities. This enables an image to be associated with multiple themes, even when there are no multiple associations in the training labels. An implementation is presented and compared to various existing algorithms, on benchmark datasets. It is shown that the proposed low dimensional representation correlates well with human scene understanding, and is able to learn theme co-occurrences without explicit training. It is also shown to outperform unsupervised latent-space methods, with much smaller training complexity, and to achieve performance close to the state of the art methods, which rely on much higher-dimensional image representations. Finally a study of the effect of dimensionality on the classification performance is presented, indicating that the dimensionality of theme space grows sub-linearly with the number of scene categories.
Cite
Text
Rasiwasia and Vasconcelos. "Scene Classification with Low-Dimensional Semantic Spaces and Weak Supervision." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587372Markdown
[Rasiwasia and Vasconcelos. "Scene Classification with Low-Dimensional Semantic Spaces and Weak Supervision." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/rasiwasia2008cvpr-scene/) doi:10.1109/CVPR.2008.4587372BibTeX
@inproceedings{rasiwasia2008cvpr-scene,
title = {{Scene Classification with Low-Dimensional Semantic Spaces and Weak Supervision}},
author = {Rasiwasia, Nikhil and Vasconcelos, Nuno},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2008},
doi = {10.1109/CVPR.2008.4587372},
url = {https://mlanthology.org/cvpr/2008/rasiwasia2008cvpr-scene/}
}