Archipelago: Nonparametric Bayesian Semi-Supervised Learning

Abstract

Semi-supervised learning (SSL), is classification where additional unlabeled data can be used to improve accuracy. Generative approaches are appealing in this situation, as good models of the data's probability density can assist in identifying clusters. Nonparametric Bayesian methods, while ideal in theory due to their principled motivations, have been difficult to apply to SSL in practice. In this work, we present a nonparametric Bayesian method that uses Gaussian processes for the generative model, avoiding many of the problems associated with Dirichlet process mixture models. Our model is fully generative and we take advantage of recent advances in Markov chain Monte Carlo algorithms to provide a practical inference method. Our method compares favorably to competing approaches on synthetic and real-world multi-class data.

Cite

Text

Adams and Ghahramani. "Archipelago: Nonparametric Bayesian Semi-Supervised Learning." International Conference on Machine Learning, 2009. doi:10.1145/1553374.1553375

Markdown

[Adams and Ghahramani. "Archipelago: Nonparametric Bayesian Semi-Supervised Learning." International Conference on Machine Learning, 2009.](https://mlanthology.org/icml/2009/adams2009icml-archipelago/) doi:10.1145/1553374.1553375

BibTeX

@inproceedings{adams2009icml-archipelago,
  title     = {{Archipelago: Nonparametric Bayesian Semi-Supervised Learning}},
  author    = {Adams, Ryan Prescott and Ghahramani, Zoubin},
  booktitle = {International Conference on Machine Learning},
  year      = {2009},
  pages     = {1-8},
  doi       = {10.1145/1553374.1553375},
  url       = {https://mlanthology.org/icml/2009/adams2009icml-archipelago/}
}