Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods
Abstract
Graph-based semi-supervised learning is the problem of propagating labels from a small number of labelled data points to a larger set of unlabelled data. This paper is concerned with the consistency of optimization-based techniques for such problems, in the limit where the labels have small noise and the underlying unlabelled data is well clustered. We study graph-based probit for binary classification, and a natural generalization of this method to multi-class classification using one-hot encoding. The resulting objective function to be optimized comprises the sum of a quadratic form defined through a rational function of the graph Laplacian, involving only the unlabelled data, and a fidelity term involving only the labelled data. The consistency analysis sheds light on the choice of the rational function defining the optimization.
Cite
Text
Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.Markdown
[Hoffmann et al. "Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods." Journal of Machine Learning Research, 2020.](https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/)BibTeX
@article{hoffmann2020jmlr-consistency,
title = {{Consistency of Semi-Supervised Learning Algorithms on Graphs: Probit and One-Hot Methods}},
author = {Hoffmann, Franca and Hosseini, Bamdad and Ren, Zhi and Stuart, Andrew M},
journal = {Journal of Machine Learning Research},
year = {2020},
pages = {1-55},
volume = {21},
url = {https://mlanthology.org/jmlr/2020/hoffmann2020jmlr-consistency/}
}