FASTRAIN-GNN: Fast and Accurate Self-Training for Graph Neural Networks

Nagarajan, Amrit; Raghunathan, Anand

FASTRAIN-GNN: Fast and Accurate Self-Training for Graph Neural Networks

TMLR 2023

/tmlr/2023/nagarajan2023tmlr-fastraingnn/

Abstract

Few-shot learning with Graph Neural Networks (GNNs) is an important challenge in expanding the remarkable success that GNNs have achieved. In the transductive node classification scenario, conventional supervised training methods for GNNs fail when only few labeled nodes are available. Self-training, wherein the GNN is trained in stages by augmenting the training data with a subset of the unlabeled data and the predictions of the GNN on this data (pseudolabels), has emerged as a promising approach to few-shot transductive learning. However, multi-stage self-training significantly increases the computational demands of GNN training. In addition, while the training set evolves considerably across the stages of self-training, the GNN architecture, graph topology and training hyperparameters are kept constant, adversely affecting the accuracy of the resulting model as well as the computational efficiency of training. To address this challenge, we propose FASTRAIN-GNN, a framework for efficient and accurate self-training of GNNs with few labeled nodes. FASTRAIN-GNN performs four main optimizations in each stage of self-training: (1) Sampling-based Pseudolabel Filtering removes nodes whose pseudolabels are likely to be incorrect from the enlarged training set. (2,3) Dynamic Sizing and Dynamic Regularization find the optimal network architecture and amount of training regularization in each stage of self-training, respectively, and (4) Progressive Graph Pruning removes selected edges between nodes in the training set to reduce the impact of over-smoothing. On few-shot node classification tasks using different GNN architectures, FASTRAIN-GNN produces models that are consistently more accurate (by up to 4.4%), while also substantially reducing the self-training time (by up to 2.1X) over the current state-of-the-art methods. Code is available at https://github.com/amrnag/FASTRAIN-GNN.

PDF TMLR Code Semantic Scholar

Cite

Text

Nagarajan and Raghunathan. "FASTRAIN-GNN: Fast and Accurate Self-Training for Graph Neural Networks." Transactions on Machine Learning Research, 2023.

Markdown

[Nagarajan and Raghunathan. "FASTRAIN-GNN: Fast and Accurate Self-Training for Graph Neural Networks." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/nagarajan2023tmlr-fastraingnn/)

BibTeX

@article{nagarajan2023tmlr-fastraingnn,
  title     = {{FASTRAIN-GNN: Fast and Accurate Self-Training for Graph Neural Networks}},
  author    = {Nagarajan, Amrit and Raghunathan, Anand},
  journal   = {Transactions on Machine Learning Research},
  year      = {2023},
  url       = {https://mlanthology.org/tmlr/2023/nagarajan2023tmlr-fastraingnn/}
}