Object Image Retrieval by Exploiting Online Knowledge Resources

Abstract

We describe a method to retrieve images found on web pages with specified object class labels, using an analysis of text around the image and of image appearance. Our method determines whether an object is both described in text and appears in a image using a discriminative image model and a generative text model. Our models are learnt by exploiting established online knowledge resources (Wikipedia pages for text; Flickr and Caltech data sets for image). These resources provide rich text and object appearance information. We describe results on two data sets. The first is Berg's collection of ten animal categories; on this data set, we outperform previous approaches [7, 33]. We have also collected five more categories. Experimental results show the effectiveness of our approach on this new data set.

Cite

Text

Wang and Forsyth. "Object Image Retrieval by Exploiting Online Knowledge Resources." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587818

Markdown

[Wang and Forsyth. "Object Image Retrieval by Exploiting Online Knowledge Resources." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/wang2008cvpr-object/) doi:10.1109/CVPR.2008.4587818

BibTeX

@inproceedings{wang2008cvpr-object,
  title     = {{Object Image Retrieval by Exploiting Online Knowledge Resources}},
  author    = {Wang, Gang and Forsyth, David A.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587818},
  url       = {https://mlanthology.org/cvpr/2008/wang2008cvpr-object/}
}