Graph Mixup on Approximate Gromov–Wasserstein Geodesics
Abstract
Mixup, which generates synthetic training samples on the data manifold, has been shown to be highly effective in augmenting Euclidean data. However, finding a proper data manifold for graph data is non-trivial, as graphs are non-Euclidean data in disparate spaces. Though efforts have been made, most of the existing graph mixup methods neglect the intrinsic geodesic guarantee, thereby generating inconsistent sample-label pairs. To address this issue, we propose GeoMix to mixup graphs on the Gromov-Wasserstein (GW) geodesics. A joint space over input graphs is first defined based on the GW distance, and graphs are then transformed into the GW space through equivalence-preserving transformations. We further show that the linear interpolation of the transformed graph pairs defines a geodesic connecting the original pairs on the GW manifold, hence ensuring the consistency between generated samples and labels. An accelerated mixup algorithm on the approximate low-dimensional GW manifold is further proposed. Extensive experiments show that the proposed GeoMix promotes the generalization and robustness of GNN models.
Cite
Text
Zeng et al. "Graph Mixup on Approximate Gromov–Wasserstein Geodesics." International Conference on Machine Learning, 2024.Markdown
[Zeng et al. "Graph Mixup on Approximate Gromov–Wasserstein Geodesics." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/zeng2024icml-graph/)BibTeX
@inproceedings{zeng2024icml-graph,
title = {{Graph Mixup on Approximate Gromov–Wasserstein Geodesics}},
author = {Zeng, Zhichen and Qiu, Ruizhong and Xu, Zhe and Liu, Zhining and Yan, Yuchen and Wei, Tianxin and Ying, Lei and He, Jingrui and Tong, Hanghang},
booktitle = {International Conference on Machine Learning},
year = {2024},
pages = {58387-58406},
volume = {235},
url = {https://mlanthology.org/icml/2024/zeng2024icml-graph/}
}