Determining the Number of Communities in Degree-Corrected Stochastic Block Models

Abstract

We propose to estimate the number of communities in degree-corrected stochastic block models based on a pseudo likelihood ratio statistic. To this end, we introduce a method that combines spectral clustering with binary segmentation. This approach guarantees an upper bound for the pseudo likelihood ratio statistic when the model is over-fitted. We also derive its limiting distribution when the model is under-fitted. Based on these properties, we establish the consistency of our estimator for the true number of communities. Developing these theoretical properties require a mild condition on the average degrees - growing at a rate no slower than log(n), where n is the number of nodes. Our proposed method is further illustrated by simulation studies and analysis of real-world networks. The numerical results show that our approach has satisfactory performance when the network is semi-dense.

Cite

Text

Ma et al. "Determining the Number of Communities in Degree-Corrected Stochastic Block Models." Journal of Machine Learning Research, 2021.

Markdown

[Ma et al. "Determining the Number of Communities in Degree-Corrected Stochastic Block Models." Journal of Machine Learning Research, 2021.](https://mlanthology.org/jmlr/2021/ma2021jmlr-determining/)

BibTeX

@article{ma2021jmlr-determining,
  title     = {{Determining the Number of Communities in Degree-Corrected Stochastic Block Models}},
  author    = {Ma, Shujie and Su, Liangjun and Zhang, Yichong},
  journal   = {Journal of Machine Learning Research},
  year      = {2021},
  pages     = {1-63},
  volume    = {22},
  url       = {https://mlanthology.org/jmlr/2021/ma2021jmlr-determining/}
}