Unsupervised Learning of Visual Sense Models for Polysemous Words

Abstract

Polysemy is a problem for methods that exploit image search engines to build object category models. Existing unsupervised approaches do not take word sense into consideration. We propose a new method that uses a dictionary to learn models of visual word sense from a large collection of unlabeled web data. The use of LDA to discover a latent sense space makes the model robust despite the very limited nature of dictionary definitions. The definitions are used to learn a distribution in the latent space that best represents a sense. The algorithm then uses the text surrounding image links to retrieve images with high probability of a particular dictionary sense. An object classifier is trained on the resulting sense-specific images. We evaluate our method on a dataset obtained by searching the web for polysemous words. Category classification experiments show that our dictionary-based approach outperforms baseline methods.

Cite

Text

Saenko and Darrell. "Unsupervised Learning of Visual Sense Models for Polysemous Words." Neural Information Processing Systems, 2008.

Markdown

[Saenko and Darrell. "Unsupervised Learning of Visual Sense Models for Polysemous Words." Neural Information Processing Systems, 2008.](https://mlanthology.org/neurips/2008/saenko2008neurips-unsupervised/)

BibTeX

@inproceedings{saenko2008neurips-unsupervised,
  title     = {{Unsupervised Learning of Visual Sense Models for Polysemous Words}},
  author    = {Saenko, Kate and Darrell, Trevor},
  booktitle = {Neural Information Processing Systems},
  year      = {2008},
  pages     = {1393-1400},
  url       = {https://mlanthology.org/neurips/2008/saenko2008neurips-unsupervised/}
}