Object Image Retrieval by Exploiting Online Knowledge Resources
Abstract
We describe a method to retrieve images found on web pages with specified object class labels, using an analysis of text around the image and of image appearance. Our method determines whether an object is both described in text and appears in a image using a discriminative image model and a generative text model. Our models are learnt by exploiting established online knowledge resources (Wikipedia pages for text; Flickr and Caltech data sets for image). These resources provide rich text and object appearance information. We describe results on two data sets. The first is Berg's collection of ten animal categories; on this data set, we outperform previous approaches [7, 33]. We have also collected five more categories. Experimental results show the effectiveness of our approach on this new data set.
Cite
Text
Wang and Forsyth. "Object Image Retrieval by Exploiting Online Knowledge Resources." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587818Markdown
[Wang and Forsyth. "Object Image Retrieval by Exploiting Online Knowledge Resources." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/wang2008cvpr-object/) doi:10.1109/CVPR.2008.4587818BibTeX
@inproceedings{wang2008cvpr-object,
title = {{Object Image Retrieval by Exploiting Online Knowledge Resources}},
author = {Wang, Gang and Forsyth, David A.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2008},
doi = {10.1109/CVPR.2008.4587818},
url = {https://mlanthology.org/cvpr/2008/wang2008cvpr-object/}
}