Network Tight Community Detection

Abstract

Conventional community detection methods often categorize all nodes into clusters. However, the presumed community structure of interest may only be valid for a subset of nodes (named as ‘tight nodes’), while the rest of the network may consist of noninformative “scattered nodes”. For example, a protein-protein network often contains proteins that do not belong to specific biological functional modules but are involved in more general processes, or act as bridges between different functional modules. Forcing each of these proteins into a single cluster introduces unwanted biases and obscures the underlying biological implication. To address this issue, we propose a tight community detection (TCD) method to identify tight communities excluding scattered nodes. The algorithm enjoys a strong theoretical guarantee of tight node identification accuracy and is scalable for large networks. The superiority of the proposed method is demonstrated by various synthetic and real experiments.

Cite

Text

Deng et al. "Network Tight Community Detection." International Conference on Machine Learning, 2024.

Markdown

[Deng et al. "Network Tight Community Detection." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/deng2024icml-network/)

BibTeX

@inproceedings{deng2024icml-network,
  title     = {{Network Tight Community Detection}},
  author    = {Deng, Jiayi and Yang, Xiaodong and Yu, Jun and Liu, Jun and Shen, Zhaiming and Huang, Danyang and Cheng, Huimin},
  booktitle = {International Conference on Machine Learning},
  year      = {2024},
  pages     = {10574-10596},
  volume    = {235},
  url       = {https://mlanthology.org/icml/2024/deng2024icml-network/}
}