TRACI: A Data-centric Approach for Multi-Domain Generalization on Graphs

Authors

  • Yusheng Zhao State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
  • Changhu Wang Department of Statistics, University of California, Los Angeles, CA, USA
  • Xiao Luo Department of Computer Science, University of California, Los Angeles, CA, USA
  • Junyu Luo State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
  • Wei Ju State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China
  • Zhiping Xiao Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
  • Ming Zhang State Key Laboratory for Multimedia Information Processing, School of Computer Science, PKU-Anker LLM Lab, Peking University, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v39i12.33463

Abstract

Graph neural networks (GNNs) have gained superior performance in graph-based prediction tasks with a variety of applications such as social analysis and drug discovery. Despite the remarkable progress, their performance often degrades on test graphs with distribution shifts. Existing domain adaptation methods rely on unlabeled test graphs during optimization, limiting their applicability to graphs in the wild. Towards this end, this paper studies the problem of multi-domain generalization on graphs, which utilizes multiple source graphs to learn a GNN with high performance on unseen target graphs. We propose a new approach named Topological Adversarial Learning with Prototypical Mixup (TRACI) to solve the problem. The fundamental principle behind our TRACI is to produce virtual adversarial and mixed graph samples from a data-centric view. In particular, TRACI enhances GNN generalization by employing a gradient-ascent strategy that considers both label prediction entropy and graph topology to craft challenging adversarial samples. Additionally, it generates domain-agnostic node representations by characterizing class-graph pair prototypes through latent distributions and applying multi-sample prototypical Mixup for distribution alignment across graphs. We further provide theoretical analysis showing that TRACI reduces the model's excess risk. Extensive experiments on various benchmark datasets demonstrate that TRACI outperforms state-of-the-art baselines, validating its effectiveness.

Downloads

Published

2025-04-11

How to Cite

Zhao, Y., Wang, C., Luo, X., Luo, J., Ju, W., Xiao, Z., & Zhang, M. (2025). TRACI: A Data-centric Approach for Multi-Domain Generalization on Graphs. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 13401–13409. https://doi.org/10.1609/aaai.v39i12.33463

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II