Randomized Spectral Co-Clustering for Large-Scale Directed Networks

Abstract

Directed networks are broadly used to represent asymmetric relationships among units. Co-clustering aims to cluster the senders and receivers of directed networks simultaneously. In particular, the well-known spectral clustering algorithm could be modified as the spectral co-clustering to co-cluster directed networks. However, large-scale networks pose great computational challenges to it. In this paper, we leverage sketching techniques and derive two randomized spectral co-clustering algorithms, one random-projection-based and the other random-sampling-based, to accelerate the co-clustering of large-scale directed networks. We theoretically analyze the resulting algorithms under two generative models – the stochastic co-block model and the degree-corrected stochastic co-block model, and establish their approximation error rates and misclustering error rates, indicating better bounds than the state-of-the-art results of co-clustering literature. Numerically, we design and conduct simulations to support our theoretical results and test the efficiency of the algorithms on real networks with up to millions of nodes. A publicly available R package RandClust is developed for better usability and reproducibility of the proposed methods.

Cite

Text

Guo et al. "Randomized Spectral Co-Clustering for Large-Scale Directed Networks." Journal of Machine Learning Research, 2023.

Markdown

[Guo et al. "Randomized Spectral Co-Clustering for Large-Scale Directed Networks." Journal of Machine Learning Research, 2023.](https://mlanthology.org/jmlr/2023/guo2023jmlr-randomized/)

BibTeX

@article{guo2023jmlr-randomized,
  title     = {{Randomized Spectral Co-Clustering for Large-Scale Directed Networks}},
  author    = {Guo, Xiao and Qiu, Yixuan and Zhang, Hai and Chang, Xiangyu},
  journal   = {Journal of Machine Learning Research},
  year      = {2023},
  pages     = {1-68},
  volume    = {24},
  url       = {https://mlanthology.org/jmlr/2023/guo2023jmlr-randomized/}
}