Semi-Supervised Classification from Discriminative Random Walks
Abstract
This paper describes a novel technique, called $\mathcal{D}$ -walks, to tackle semi-supervised classification problems in large graphs. We introduce here a betweenness measure based on passage times during random walks of bounded lengths. Such walks are further constrained to start and end in nodes within the same class, defining a distinct betweenness for each class. Unlabeled nodes are classified according to the class showing the highest betweenness. Forward and backward recurrences are derived to efficiently compute the passage times. $\mathcal{D}$ -walks can deal with directed or undirected graphs with a linear time complexity with respect to the number of edges, the maximum walk length considered and the number of classes. Experiments on various real-life databases show that $\mathcal{D}$ -walks outperforms NetKit [5], the approach of Zhou and Schölkopf [15] and the regularized laplacian kernel [2]. The benefit of $\mathcal{D}$ -walks is particularly noticeable when few labeled nodes are available. The computation time of $\mathcal{D}$ -walks is also substantially lower in all cases.
Cite
Text
Callut et al. "Semi-Supervised Classification from Discriminative Random Walks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008. doi:10.1007/978-3-540-87479-9_29Markdown
[Callut et al. "Semi-Supervised Classification from Discriminative Random Walks." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008.](https://mlanthology.org/ecmlpkdd/2008/callut2008ecmlpkdd-semisupervised/) doi:10.1007/978-3-540-87479-9_29BibTeX
@inproceedings{callut2008ecmlpkdd-semisupervised,
title = {{Semi-Supervised Classification from Discriminative Random Walks}},
author = {Callut, Jérôme and Françoisse, Kevin and Saerens, Marco and Dupont, Pierre},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2008},
pages = {162-177},
doi = {10.1007/978-3-540-87479-9_29},
url = {https://mlanthology.org/ecmlpkdd/2008/callut2008ecmlpkdd-semisupervised/}
}