Learning the Semantics of Words and Pictures
Abstract
We present a statistical model for organizing image collections which integrates semantic information provided by associate text and visual information provided by image features. The model is very promising for information retrieval tasks such as database browsing and searching for images based on text and/or image features. Furthermore, since the model learns relationships between text and image features, it can be used for novel applications such as associating words with pictures, and unsupervised learning for object recognition.
Cite
Text
Barnard and Forsyth. "Learning the Semantics of Words and Pictures." IEEE/CVF International Conference on Computer Vision, 2001. doi:10.1109/ICCV.2001.937654Markdown
[Barnard and Forsyth. "Learning the Semantics of Words and Pictures." IEEE/CVF International Conference on Computer Vision, 2001.](https://mlanthology.org/iccv/2001/barnard2001iccv-learning/) doi:10.1109/ICCV.2001.937654BibTeX
@inproceedings{barnard2001iccv-learning,
title = {{Learning the Semantics of Words and Pictures}},
author = {Barnard, Kobus and Forsyth, David A.},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2001},
pages = {408-415},
doi = {10.1109/ICCV.2001.937654},
url = {https://mlanthology.org/iccv/2001/barnard2001iccv-learning/}
}