Semi-Supervised Latent Block Model with Pairwise Constraints

Abstract

Co-clustering aims at simultaneously partitioning both dimensions of a data matrix. It has demonstrated better performances than one-sided clustering for high-dimensional data. The Latent Block Model (LBM) is a probabilistic model for co-clustering based on mixture models that has proven useful for a broad class of data. In this paper, we propose to leverage prior knowledge in the form of pairwise semi-supervision in both row and column space to improve the clustering performances of the algorithms derived from this model. We present a general probabilistic framework for incorporating must link and cannot link relationships in the LBM based on Hidden Markov Random Fields. We instantiate this framework on a model for count data and present two inference algorithms based on Variational and Classification EM. Extensive experiments on simulated data and on real-world attributed networks confirm the interest of our approach and demonstrate the effectiveness of our algorithms.

Cite

Text

Riverain et al. "Semi-Supervised Latent Block Model with Pairwise Constraints." Machine Learning, 2022. doi:10.1007/S10994-022-06137-4

Markdown

[Riverain et al. "Semi-Supervised Latent Block Model with Pairwise Constraints." Machine Learning, 2022.](https://mlanthology.org/mlj/2022/riverain2022mlj-semisupervised/) doi:10.1007/S10994-022-06137-4

BibTeX

@article{riverain2022mlj-semisupervised,
  title     = {{Semi-Supervised Latent Block Model with Pairwise Constraints}},
  author    = {Riverain, Paul and Fossier, Simon and Nadif, Mohamed},
  journal   = {Machine Learning},
  year      = {2022},
  pages     = {1739-1764},
  doi       = {10.1007/S10994-022-06137-4},
  volume    = {111},
  url       = {https://mlanthology.org/mlj/2022/riverain2022mlj-semisupervised/}
}