Communication-Optimal Distributed Dynamic Graph Clustering

Abstract

We consider the problem of clustering graph nodes over large-scale dynamic graphs, such as citation networks, images and web networks, when graph updates such as node/edge insertions/deletions are observed distributively. We propose communication-efficient algorithms for two well-established communication models namely the message passing and the blackboard models. Given a graph with n nodes that is observed at s remote sites over time [1, t], the two proposed algorithms have communication costs Õ(ns) and Õ(n + s) (Õ hides a polylogarithmic factor), almost matching their lower bounds, Ω(ns) and Ω (n + s), respectively, in the message passing and the blackboard models. More importantly, we prove that at each time point in [1, t] our algorithms generate clustering quality nearly as good as that of centralizing all updates up to that time and then applying a standard centralized clustering algorithm. We conducted extensive experiments on both synthetic and real-life datasets which confirmed the communication efficiency of our approach over baseline algorithms while achieving comparable clustering results.

Cite

Text

Zhu et al. "Communication-Optimal Distributed Dynamic Graph Clustering." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33015957

Markdown

[Zhu et al. "Communication-Optimal Distributed Dynamic Graph Clustering." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/zhu2019aaai-communication/) doi:10.1609/AAAI.V33I01.33015957

BibTeX

@inproceedings{zhu2019aaai-communication,
  title     = {{Communication-Optimal Distributed Dynamic Graph Clustering}},
  author    = {Zhu, Chun Jiang and Zhu, Tan and Lam, Kam-yiu and Han, Song and Bi, Jinbo},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {5957-5964},
  doi       = {10.1609/AAAI.V33I01.33015957},
  url       = {https://mlanthology.org/aaai/2019/zhu2019aaai-communication/}
}