IGLU: Efficient GCN Training via Lazy Updates

Narayanan, S Deepak; Sinha, Aditya; Jain, Prateek; Kar, Purushottam; Sellamanickam, Sundararajan

IGLU: Efficient GCN Training via Lazy Updates

S Deepak Narayanan, Aditya Sinha, Prateek Jain, Purushottam Kar, Sundararajan Sellamanickam

ICLR 2022

/iclr/2022/narayanan2022iclr-iglu/

Abstract

Training multi-layer Graph Convolution Networks (GCN) using standard SGD techniques scales poorly as each descent step ends up updating node embeddings for a large portion of the graph. Recent attempts to remedy this sub-sample the graph that reduces compute but introduce additional variance and may offer suboptimal performance. This paper develops the IGLU method that caches intermediate computations at various GCN layers thus enabling lazy updates that significantly reduce the compute cost of descent. IGLU introduces bounded bias into the gradients but nevertheless converges to a first-order saddle point under standard assumptions such as objective smoothness. Benchmark experiments show that IGLU offers up to 1.2% better accuracy despite requiring up to 88% less compute.

PDF ICLR Semantic Scholar

Cite

Text

Narayanan et al. "IGLU: Efficient GCN Training via Lazy Updates." International Conference on Learning Representations, 2022.

Markdown

[Narayanan et al. "IGLU: Efficient GCN Training via Lazy Updates." International Conference on Learning Representations, 2022.](https://mlanthology.org/iclr/2022/narayanan2022iclr-iglu/)

BibTeX

@inproceedings{narayanan2022iclr-iglu,
  title     = {{IGLU: Efficient GCN Training via Lazy Updates}},
  author    = {Narayanan, S Deepak and Sinha, Aditya and Jain, Prateek and Kar, Purushottam and Sellamanickam, Sundararajan},
  booktitle = {International Conference on Learning Representations},
  year      = {2022},
  url       = {https://mlanthology.org/iclr/2022/narayanan2022iclr-iglu/}
}