TRACI: A Data-centric Approach for Multi-Domain Generalization on Graphs

Yusheng Zhao; Changhu Wang; Xiao Luo; Junyu Luo; Wei Ju; Zhiping Xiao; Ming Zhang

doi:10.1609/aaai.v39i12.33463

Authors

Yusheng Zhao State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
Changhu Wang Department of Statistics, University of California, Los Angeles, CA, USA
Xiao Luo Department of Computer Science, University of California, Los Angeles, CA, USA
Junyu Luo State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
Wei Ju State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
Zhiping Xiao Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
Ming Zhang State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v39i12.33463

Abstract

Graph neural networks (GNNs) have gained superior performance in graph-based prediction tasks with a variety of applications such as social analysis and drug discovery. Despite the remarkable progress, their performance often degrades on test graphs with distribution shifts. Existing domain adaptation methods rely on unlabeled test graphs during optimization, limiting their applicability to graphs in the wild. Towards this end, this paper studies the problem of multi-domain generalization on graphs, which utilizes multiple source graphs to learn a GNN with high performance on unseen target graphs. We propose a new approach named Topological Adversarial Learning with Prototypical Mixup (TRACI) to solve the problem. The fundamental principle behind our TRACI is to produce virtual adversarial and mixed graph samples from a data-centric view. In particular, TRACI enhances GNN generalization by employing a gradient-ascent strategy that considers both label prediction entropy and graph topology to craft challenging adversarial samples. Additionally, it generates domain-agnostic node representations by characterizing class-graph pair prototypes through latent distributions and applying multi-sample prototypical Mixup for distribution alignment across graphs. We further provide theoretical analysis showing that TRACI reduces the model's excess risk. Extensive experiments on various benchmark datasets demonstrate that TRACI outperforms state-of-the-art baselines, validating its effectiveness.

TRACI: A Data-centric Approach for Multi-Domain Generalization on Graphs

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information