GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs

Abstract

Graph-based patterns are extensively employed and favored by practitioners within industrial companies due to their capacity to represent the behavioral attributes and topological relationships among users, thereby offering enhanced interpretability in comparison to black-box models commonly utilized for classification and recognition tasks. For instance, within the scenario of transaction risk management, a graph pattern that is characteristic of a particular risk category can be readily employed to discern transactions fraught with risk, delineate networks of criminal activity, or investigate the methodologies employed by fraudsters. Nonetheless, graph data in industrial settings is often characterized by its massive scale, encompassing data sets with millions or even billions of nodes, making the manual extraction of graph patterns not only labor-intensive but also necessitating specialized knowledge in particular domains of risk. Moreover, existing methodologies for mining graph patterns encounter significant obstacles when tasked with analyzing large-scale attributed graphs. In this work, we introduce GraphRPM, an industry-purpose parallel and distributed risk pattern mining framework on large attributed graphs. The framework incorporates a novel edge-involved graph isomorphism network alongside optimized operations for parallel graph computation, which collectively contribute to a considerable reduction in computational complexity and resource expenditure. Moreover, the intelligent filtration of efficacious risky graph patterns is facilitated by the proposed evaluation metrics. Comprehensive experimental evaluations conducted on real-world datasets of varying sizes substantiate the capability of GraphRPM to adeptly address the challenges inherent in mining patterns from large-scale industrial attributed graphs, thereby underscoring its substantial value for industrial deployment.

Cite

Text

Tian et al. "GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70381-2_9

Markdown

[Tian et al. "GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/tian2024ecmlpkdd-graphrpm/) doi:10.1007/978-3-031-70381-2_9

BibTeX

@inproceedings{tian2024ecmlpkdd-graphrpm,
  title     = {{GraphRPM: Risk Pattern Mining on Industrial Large Attributed Graphs}},
  author    = {Tian, Sheng and Zeng, Xintan and Hu, Yifei and Wang, Baokun and Liu, Yongchao and Jin, Yue and Meng, Changhua and Hong, Chuntao and Zhang, Tianyi and Wang, Weiqiang},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {133-149},
  doi       = {10.1007/978-3-031-70381-2_9},
  url       = {https://mlanthology.org/ecmlpkdd/2024/tian2024ecmlpkdd-graphrpm/}
}