Learning from Human-Generated Lists

Abstract

Human-generated lists are a form of non-iid data with important applications in machine learning and cognitive psychology. We propose a generative model - sampling with reduced replacement (SWIRL) - for such lists. We discuss SWIRL’s relation to standard sampling paradigms, provide the maximum likelihood estimate for learning, and demonstrate its value with two real-world applications: (i) In a ""feature volunteering"" task where non-experts spontaneously generate feature=>label pairs for text classification, SWIRL improves the accuracy of state-of-the-art feature-learning frameworks. (ii) In a ""verbal fluency"" task where brain-damaged patients generate word lists when prompted with a category, SWIRL parameters align well with existing psychological theories, and our model can classify healthy people vs. patients from the lists they generate.

Cite

Text

Jun et al. "Learning from Human-Generated Lists." International Conference on Machine Learning, 2013.

Markdown

[Jun et al. "Learning from Human-Generated Lists." International Conference on Machine Learning, 2013.](https://mlanthology.org/icml/2013/jun2013icml-learning/)

BibTeX

@inproceedings{jun2013icml-learning,
  title     = {{Learning from Human-Generated Lists}},
  author    = {Jun, Kwang-Sung and Zhu, Jerry and Settles, Burr and Rogers, Timothy},
  booktitle = {International Conference on Machine Learning},
  year      = {2013},
  pages     = {181-189},
  volume    = {28},
  url       = {https://mlanthology.org/icml/2013/jun2013icml-learning/}
}