Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods

Abstract

Graph-based semi-supervised learning is the problem of propagating labels from a small number of labelled data points to a larger set of unlabelled data. This paper is concerned with the consistency of optimization-based techniques for such problems, in the limit where the labels have small noise and the underlying unlabelled data is well clustered. We study graph-based probit for binary classification, and a natural generalization of this method to multi-class classification using one-hot encoding. The resulting objective function to be optimized comprises the sum of a quadratic form defined through a rational function of the graph Laplacian, involving only the unlabelled data, and a fidelity term involving only the labelled data. The consistency analysis sheds light on the choice of the rational function defining the optimization.

Cite

Text

Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.

Markdown

[Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.](https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/)

BibTeX

@article{hoffmann2020jmlr-consistency,
  title     = {{Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods}},
  author    = {Hoffmann, Franca and Hosseini, Bamdad and Ren, Zhi and Stuart, Andrew M},
  journal   = {Journal of Machine Learning Research},
  year      = {2020},
  pages     = {1-55},
  volume    = {21},
  url       = {https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/}
}