Exploiting Unlabeled Data in Content-Based Image Retrieval

Abstract

In this paper, the Ssair (Semi-Supervised Active Image Retrieval) approach, which attempts to exploit unlabeled data to improve the performance of content-based image retrieval ( Cbir ), is proposed. This approach combines the merits of semi-supervised learning and active learning. In detail, in each round of relevance feedback, two simple learners are trained from the labeled data, i.e. images from user query and user feedback. Each learner then classifies the unlabeled images in the database and passes the most relevant/irrelevant images to the other learner. After re-training with the additional labeled data, the learners classify the images in the database again and then their classifications are merged. Images judged to be relevant with high confidence are returned as the retrieval result, while these judged with low confidence are put into the pool which is used in the next round of relevance feedback. Experiments show that semi-supervised learning and active learning mechanisms are both beneficial to Cbir .

Cite

Text

Zhou et al. "Exploiting Unlabeled Data in Content-Based Image Retrieval." European Conference on Machine Learning, 2004. doi:10.1007/978-3-540-30115-8_48

Markdown

[Zhou et al. "Exploiting Unlabeled Data in Content-Based Image Retrieval." European Conference on Machine Learning, 2004.](https://mlanthology.org/ecmlpkdd/2004/zhou2004ecml-exploiting/) doi:10.1007/978-3-540-30115-8_48

BibTeX

@inproceedings{zhou2004ecml-exploiting,
  title     = {{Exploiting Unlabeled Data in Content-Based Image Retrieval}},
  author    = {Zhou, Zhi-Hua and Chen, Ke-Jia and Jiang, Yuan},
  booktitle = {European Conference on Machine Learning},
  year      = {2004},
  pages     = {525-536},
  doi       = {10.1007/978-3-540-30115-8_48},
  url       = {https://mlanthology.org/ecmlpkdd/2004/zhou2004ecml-exploiting/}
}