Data Augmentation for Graph Neural Networks


  • Tong Zhao University of Notre Dame
  • Yozen Liu Snap Inc
  • Leonardo Neves Snap Inc.
  • Oliver Woodford Snap Inc
  • Meng Jiang University of Notre Dame
  • Neil Shah Snap Inc.



Graph-based Machine Learning, Semi-Supervised Learning


Data augmentation has been widely used to improve generalizability of machine learning models. However, comparatively little work studies data augmentation for graphs. This is largely due to the complex, non-Euclidean structure of graphs, which limits possible manipulation operations. Augmentation operations commonly used in vision and language have no analogs for graphs. Our work studies graph data augmentation for graph neural networks (GNNs) in the context of improving semi-supervised node-classification. We discuss practical and theoretical motivations, considerations and strategies for graph data augmentation. Our work shows that neural edge predictors can effectively encode class-homophilic structure to promote intra-class edges and demote inter-class edges in given graph structure, and our main contribution introduces the GAug graph data augmentation framework, which leverages these insights to improve performance in GNN-based node classification via edge prediction. Extensive experiments on multiple benchmarks show that augmentation via GAug improves performance across GNN architectures and datasets.




How to Cite

Zhao, T., Liu, Y., Neves, L., Woodford, O., Jiang, M., & Shah, N. (2021). Data Augmentation for Graph Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 11015-11023.



AAAI Technical Track on Machine Learning V