Predicting Human Similarity Judgments Using Large Language Models

Abstract

Similarity judgments provide a well-established method for accessing mental representations, with applications in psychology, neuroscience and machine learning. However, collecting similarity judgments can be prohibitively expensive for naturalistic datasets as the number of comparisons grows quadratically in the number of stimuli. We leverage recent advances in language models and online recruitment, proposing an efficient domain-general procedure for predicting human similarity judgments based on text descriptions. Crucially, the number of descriptions required grows only linearly with the number of stimuli, drastically reducing the amount of data required. We test this procedure on six datasets of naturalistic images and show that our models outperform previous approaches based on visual information.

Cite

Text

Marjieh et al. "Predicting Human Similarity Judgments Using Large Language Models." ICML 2022 Workshops: Pre-Training, 2022.

Markdown

[Marjieh et al. "Predicting Human Similarity Judgments Using Large Language Models." ICML 2022 Workshops: Pre-Training, 2022.](https://mlanthology.org/icmlw/2022/marjieh2022icmlw-predicting/)

BibTeX

@inproceedings{marjieh2022icmlw-predicting,
  title     = {{Predicting Human Similarity Judgments Using Large Language Models}},
  author    = {Marjieh, Raja and Sucholutsky, Ilia and Sumers, Theodore and Jacoby, Nori and Griffiths, Thomas L.},
  booktitle = {ICML 2022 Workshops: Pre-Training},
  year      = {2022},
  url       = {https://mlanthology.org/icmlw/2022/marjieh2022icmlw-predicting/}
}