OASIS: Online Active Semi-Supervised Learning

Abstract

We consider a learning setting of importance to large scale machine learning: potentially unlimited data arrives sequentially, but only a small fraction of it is labeled. The learner cannot store the data; it should learn from both labeled and unlabeled data, and it may also request labels for some of the unlabeled items. This setting is frequently encountered in real-world applications and has the characteristics of online, semi-supervised, and active learning. Yet previous learning models fail to consider these characteristics jointly. We present OASIS, a Bayesian model for this learning setting. The main contributions of the model include the novel integration of a semi-supervised likelihood function, a sequential Monte Carlo scheme for efficient online Bayesian updating, and a posterior-reduction criterion for active learning. Encouraging results on both synthetic and real-world optical character recognition data demonstrate the synergy of these characteristics in OASIS.

Cite

Text

Goldberg et al. "OASIS: Online Active Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2011. doi:10.1609/AAAI.V25I1.7910

Markdown

[Goldberg et al. "OASIS: Online Active Semi-Supervised Learning." AAAI Conference on Artificial Intelligence, 2011.](https://mlanthology.org/aaai/2011/goldberg2011aaai-oasis/) doi:10.1609/AAAI.V25I1.7910

BibTeX

@inproceedings{goldberg2011aaai-oasis,
  title     = {{OASIS: Online Active Semi-Supervised Learning}},
  author    = {Goldberg, Andrew B. and Zhu, Xiaojin and Furger, Alex and Xu, Jun-Ming},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2011},
  pages     = {362-367},
  doi       = {10.1609/AAAI.V25I1.7910},
  url       = {https://mlanthology.org/aaai/2011/goldberg2011aaai-oasis/}
}