Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck

Fu, Xingcheng; Gao, Yisen; Yang, Beining; Wu, Yuxuan; Qian, Haodong; Sun, Qingyun; Li, Xianxian

doi:10.1609/AAAI.V39I16.33832

Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck

Xingcheng Fu, Yisen Gao, Beining Yang, Yuxuan Wu, Haodong Qian, Qingyun Sun, Xianxian Li

AAAI 2025 pp. 16674-16681

doi:10.1609/AAAI.V39I16.33832 /aaai/2025/fu2025aaai-bi/

Abstract

Dataset condensation has significantly improved model training efficiency, but its application on devices with different computing power brings new requirements for different data sizes. For sparse graph data with non-Euclidean structures, repeated condensation of each scale may lead to significant computational costs. Thus, condensing multiple scale graphs simultaneously is the core of achieving efficient training in different on-device scenarios. Existing efficient works for multi-scale graph dataset condensation mainly perform efficient approximate computation in scale order (large-to-small or small-to-large scales). However, these two commonly used paradigms for multi-scale graph dataset condensation have serious ''scaling down degradation'' and ''scaling up collapse" problems of a graph. The main bottleneck of the above paradigms is whether the effective information of the original graph is fully preserved when consenting to the primary sub-scale (the first of multiple scales), which determines the condensation effect and consistency of all scales. In this paper, we proposed a novel GNN-centric Bi-directional Multi-Scale Graph Dataset Condensation (BiMSGC) framework, to explore unifying paradigms by operating on both large-to-small and small-to-large for multi-scale graph condensation. Based on the mutual information theory, we estimate an optimal ''meso-scale'' to obtain the minimum necessary dense graph preserving the maximum utility information of the original graph, and then we achieve stable and consistent ''bi-directional'' condensation learning by optimizing graph eigenbasis matching with information bottleneck on other scales. Encouraging empirical results on several datasets demonstrates the significant superiority of the proposed framework in graph condensation at different scales.

PDF AAAI Semantic Scholar

Cite

Text

Fu et al. "Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I16.33832

Markdown

[Fu et al. "Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/fu2025aaai-bi/) doi:10.1609/AAAI.V39I16.33832

BibTeX

@inproceedings{fu2025aaai-bi,
  title     = {{Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck}},
  author    = {Fu, Xingcheng and Gao, Yisen and Yang, Beining and Wu, Yuxuan and Qian, Haodong and Sun, Qingyun and Li, Xianxian},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {16674-16681},
  doi       = {10.1609/AAAI.V39I16.33832},
  url       = {https://mlanthology.org/aaai/2025/fu2025aaai-bi/}
}