Harvesting Large-Scale Weakly-Tagged Image Databases from the Web

Abstract

To leverage large-scale weakly-tagged images for computer vision tasks (such as object detection and scene recognition), a novel cross-modal tag cleansing and junk image filtering algorithm is developed for cleansing the weakly-tagged images and their social tags (i.e., removing irrelevant images and finding the most relevant tags for each image) by integrating both the visual similarity contexts between the images and the semantic similarity contexts between their tags. Our algorithm can address the issues of spams, polysemes and synonyms more effectively and determine the relevance between the images and their social tags more precisely, thus it can allow us to create large amounts of training images with more reliable labels by harvesting from large-scale weakly-tagged images, which can further be used to achieve more effective classifier training for many computer vision tasks.

Cite

Text

Fan et al. "Harvesting Large-Scale Weakly-Tagged Image Databases from the Web." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5540135

Markdown

[Fan et al. "Harvesting Large-Scale Weakly-Tagged Image Databases from the Web." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/fan2010cvpr-harvesting/) doi:10.1109/CVPR.2010.5540135

BibTeX

@inproceedings{fan2010cvpr-harvesting,
  title     = {{Harvesting Large-Scale Weakly-Tagged Image Databases from the Web}},
  author    = {Fan, Jianping and Shen, Yi and Zhou, Ning and Gao, Yuli},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2010},
  pages     = {802-809},
  doi       = {10.1109/CVPR.2010.5540135},
  url       = {https://mlanthology.org/cvpr/2010/fan2010cvpr-harvesting/}
}