Tree Mover's Distance: Bridging Graph Metrics and Stability of Graph Neural Networks
Abstract
Understanding generalization and robustness of machine learning models fundamentally relies on assuming an appropriate metric on the data space. Identifying such a metric is particularly challenging for non-Euclidean data such as graphs. Here, we propose a pseudometric for attributed graphs, the Tree Mover's Distance (TMD), and study its relation to generalization. Via a hierarchical optimal transport problem, TMD reflects the local distribution of node attributes as well as the distribution of local computation trees, which are known to be decisive for the learning behavior of graph neural networks (GNNs). First, we show that TMD captures properties relevant for graph classification: a simple TMD-SVM can perform competitively with standard GNNs. Second, we relate TMD to generalization of GNNs under distribution shifts, and show that it correlates well with performance drop under such shifts.
Cite
Text
Chuang and Jegelka. "Tree Mover's Distance: Bridging Graph Metrics and Stability of Graph Neural Networks." Neural Information Processing Systems, 2022.Markdown
[Chuang and Jegelka. "Tree Mover's Distance: Bridging Graph Metrics and Stability of Graph Neural Networks." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/chuang2022neurips-tree/)BibTeX
@inproceedings{chuang2022neurips-tree,
title = {{Tree Mover's Distance: Bridging Graph Metrics and Stability of Graph Neural Networks}},
author = {Chuang, Ching-Yao and Jegelka, Stefanie},
booktitle = {Neural Information Processing Systems},
year = {2022},
url = {https://mlanthology.org/neurips/2022/chuang2022neurips-tree/}
}