Semantics Discovery for Image Indexing

Abstract

To bridge the gap between low-level features and high-level semantic queries in image retrieval, detecting meaningful visual entities (e.g. faces, sky, foliage, buildings etc) based on trained pattern classifiers has become an active research trend. However, a drawback of the supervised learning approach is the human effort to provide labeled regions as training samples. In this paper, we propose a new three-stage hybrid framework to discover local semantic patterns and generate their samples for training with minimal human intervention. Support vector machines (SVM) are first trained on local image blocks from a small number of images labeled as several semantic categories. Then to bootstrap the local semantics, image blocks that produce high SVM outputs are grouped into Discovered Semantic Regions (DSRs) using fuzzy c-means clustering. The training samples for these DSRs are automatically induced from cluster memberships and subject to support vector machine learning to form local semantic detectors for DSRs. An image is then indexed as a tessellation of DSR histograms and matched using histogram intersection. We evaluate our method against the linear fusion of color and texture features using 16 semantic queries on 2400 heterogeneous consumer photos. The DSR models achieved a promising 26% improvement in average precision over that of the feature fusion approach.

Cite

Text

Lim and Jin. "Semantics Discovery for Image Indexing." European Conference on Computer Vision, 2004. doi:10.1007/978-3-540-24670-1_21

Markdown

[Lim and Jin. "Semantics Discovery for Image Indexing." European Conference on Computer Vision, 2004.](https://mlanthology.org/eccv/2004/lim2004eccv-semantics/) doi:10.1007/978-3-540-24670-1_21

BibTeX

@inproceedings{lim2004eccv-semantics,
  title     = {{Semantics Discovery for Image Indexing}},
  author    = {Lim, Joo-Hwee and Jin, Jesse S.},
  booktitle = {European Conference on Computer Vision},
  year      = {2004},
  pages     = {270-281},
  doi       = {10.1007/978-3-540-24670-1_21},
  url       = {https://mlanthology.org/eccv/2004/lim2004eccv-semantics/}
}