Learning the Semantics of Words and Pictures

Abstract

We present a statistical model for organizing image collections which integrates semantic information provided by associate text and visual information provided by image features. The model is very promising for information retrieval tasks such as database browsing and searching for images based on text and/or image features. Furthermore, since the model learns relationships between text and image features, it can be used for novel applications such as associating words with pictures, and unsupervised learning for object recognition.

Cite

Text

Barnard and Forsyth. "Learning the Semantics of Words and Pictures." IEEE/CVF International Conference on Computer Vision, 2001. doi:10.1109/ICCV.2001.937654

Markdown

[Barnard and Forsyth. "Learning the Semantics of Words and Pictures." IEEE/CVF International Conference on Computer Vision, 2001.](https://mlanthology.org/iccv/2001/barnard2001iccv-learning/) doi:10.1109/ICCV.2001.937654

BibTeX

@inproceedings{barnard2001iccv-learning,
  title     = {{Learning the Semantics of Words and Pictures}},
  author    = {Barnard, Kobus and Forsyth, David A.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2001},
  pages     = {408-415},
  doi       = {10.1109/ICCV.2001.937654},
  url       = {https://mlanthology.org/iccv/2001/barnard2001iccv-learning/}
}