Exphormer: Sparse Transformers for Graphs

Abstract

Graph transformers have emerged as a promising architecture for a variety of graph learning and representation tasks. Despite their successes, though, it remains challenging to scale graph transformers to large graphs while maintaining accuracy competitive with message-passing networks. In this paper, we introduce Exphormer, a framework for building powerful and scalable graph transformers. Exphormer consists of a sparse attention mechanism based on two mechanisms: virtual global nodes and expander graphs, whose mathematical characteristics, such as spectral expansion, pseduorandomness, and sparsity, yield graph transformers with complexity only linear in the size of the graph, while allowing us to prove desirable theoretical properties of the resulting transformer models. We show that incorporating Exphormer into the recently-proposed GraphGPS framework produces models with competitive empirical results on a wide variety of graph datasets, including state-of-the-art results on three datasets. We also show that Exphormer can scale to datasets on larger graphs than shown in previous graph transformer architectures.

Cite

Text

Shirzad et al. "Exphormer: Sparse Transformers for Graphs." International Conference on Machine Learning, 2023.

Markdown

[Shirzad et al. "Exphormer: Sparse Transformers for Graphs." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/shirzad2023icml-exphormer/)

BibTeX

@inproceedings{shirzad2023icml-exphormer,
  title     = {{Exphormer: Sparse Transformers for Graphs}},
  author    = {Shirzad, Hamed and Velingker, Ameya and Venkatachalam, Balaji and Sutherland, Danica J. and Sinop, Ali Kemal},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {31613-31632},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/shirzad2023icml-exphormer/}
}