Semi-Supervised Latent Block Model with Pairwise Constraints
Abstract
Co-clustering aims at simultaneously partitioning both dimensions of a data matrix. It has demonstrated better performances than one-sided clustering for high-dimensional data. The Latent Block Model (LBM) is a probabilistic model for co-clustering based on mixture models that has proven useful for a broad class of data. In this paper, we propose to leverage prior knowledge in the form of pairwise semi-supervision in both row and column space to improve the clustering performances of the algorithms derived from this model. We present a general probabilistic framework for incorporating must link and cannot link relationships in the LBM based on Hidden Markov Random Fields. We instantiate this framework on a model for count data and present two inference algorithms based on Variational and Classification EM. Extensive experiments on simulated data and on real-world attributed networks confirm the interest of our approach and demonstrate the effectiveness of our algorithms.
Cite
Text
Riverain et al. "Semi-Supervised Latent Block Model with Pairwise Constraints." Machine Learning, 2022. doi:10.1007/S10994-022-06137-4Markdown
[Riverain et al. "Semi-Supervised Latent Block Model with Pairwise Constraints." Machine Learning, 2022.](https://mlanthology.org/mlj/2022/riverain2022mlj-semisupervised/) doi:10.1007/S10994-022-06137-4BibTeX
@article{riverain2022mlj-semisupervised,
title = {{Semi-Supervised Latent Block Model with Pairwise Constraints}},
author = {Riverain, Paul and Fossier, Simon and Nadif, Mohamed},
journal = {Machine Learning},
year = {2022},
pages = {1739-1764},
doi = {10.1007/S10994-022-06137-4},
volume = {111},
url = {https://mlanthology.org/mlj/2022/riverain2022mlj-semisupervised/}
}