Unsupervised Learning of Visual Sense Models for Polysemous Words
Abstract
Polysemy is a problem for methods that exploit image search engines to build object category models. Existing unsupervised approaches do not take word sense into consideration. We propose a new method that uses a dictionary to learn models of visual word sense from a large collection of unlabeled web data. The use of LDA to discover a latent sense space makes the model robust despite the very limited nature of dictionary definitions. The definitions are used to learn a distribution in the latent space that best represents a sense. The algorithm then uses the text surrounding image links to retrieve images with high probability of a particular dictionary sense. An object classifier is trained on the resulting sense-specific images. We evaluate our method on a dataset obtained by searching the web for polysemous words. Category classification experiments show that our dictionary-based approach outperforms baseline methods.
Cite
Text
Saenko and Darrell. "Unsupervised Learning of Visual Sense Models for Polysemous Words." Neural Information Processing Systems, 2008.Markdown
[Saenko and Darrell. "Unsupervised Learning of Visual Sense Models for Polysemous Words." Neural Information Processing Systems, 2008.](https://mlanthology.org/neurips/2008/saenko2008neurips-unsupervised/)BibTeX
@inproceedings{saenko2008neurips-unsupervised,
title = {{Unsupervised Learning of Visual Sense Models for Polysemous Words}},
author = {Saenko, Kate and Darrell, Trevor},
booktitle = {Neural Information Processing Systems},
year = {2008},
pages = {1393-1400},
url = {https://mlanthology.org/neurips/2008/saenko2008neurips-unsupervised/}
}