Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods

Franca Hoffmann, Bamdad Hosseini, Zhi Ren, Andrew M Stuart

JMLR 2020 pp. 1-55

/jmlr/2020/hoffmann2020jmlr-consistency/

Abstract

Graph-based semi-supervised learning is the problem of propagating labels from a small number of labelled data points to a larger set of unlabelled data. This paper is concerned with the consistency of optimization-based techniques for such problems, in the limit where the labels have small noise and the underlying unlabelled data is well clustered. We study graph-based probit for binary classification, and a natural generalization of this method to multi-class classification using one-hot encoding. The resulting objective function to be optimized comprises the sum of a quadratic form defined through a rational function of the graph Laplacian, involving only the unlabelled data, and a fidelity term involving only the labelled data. The consistency analysis sheds light on the choice of the rational function defining the optimization.

PDF JMLR Semantic Scholar

Cite

Text

Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.

Markdown

[Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.](https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/)

BibTeX

@article{hoffmann2020jmlr-consistency,
  title     = {{Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods}},
  author    = {Hoffmann, Franca and Hosseini, Bamdad and Ren, Zhi and Stuart, Andrew M},
  journal   = {Journal of Machine Learning Research},
  year      = {2020},
  pages     = {1-55},
  volume    = {21},
  url       = {https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/}
}