Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models

Authors

  • Jiayi Luo Beihang University
  • Qingyun Sun Beihang University
  • Lingjuan Lyu Sony
  • Ziwei Zhang Beihang University
  • Haonan Yuan Beihang University
  • Xingcheng Fu Guangxi Normal University
  • Jianxin Li Beihang University

DOI:

https://doi.org/10.1609/aaai.v40i29.39593

Abstract

Graph Foundation Models (GFMs) are pre-trained on diverse source domains and adapted to unseen targets, enabling broad generalization for graph machine learning. Despite that GFMs have attracted considerable attention recently, their vulnerability to backdoor attacks remains largely underexplored. A compromised GFM can introduce backdoor behaviors into downstream applications, posing serious security risks. However, launching backdoor attacks against GFMs is non-trivial due to three key challenges. (1) Effectiveness: Attackers lack knowledge of the downstream task during pre-training, complicating the assurance that triggers reliably induce misclassifications into desired classes. (2) Stealthiness: The variability in node features across domains complicates trigger insertion that remains stealthy. (3) Persistence: Downstream fine-tuning may erase backdoor behaviors by updating model parameters. To address these challenges, we propose GFM-BA, a novel Backdoor Attack model against Graph Foundation Models. Specifically, we first design a label-free trigger association module that links the trigger to a set of prototype embeddings, eliminating the need for knowledge about downstream tasks to perform backdoor injection. Then, we introduce a node-adaptive trigger generator, dynamically producing node-specific triggers, reducing the risk of trigger detection while reliably activating the backdoor. Lastly, we develop a persistent backdoor anchoring module that firmly anchors the backdoor to fine-tuning-insensitive parameters, enhancing the persistence of the backdoor under downstream adaptation. Extensive experiments demonstrate the effectiveness, stealthiness, and persistence of GFM-BA.

Downloads

Published

2026-03-14

How to Cite

Luo, J., Sun, Q., Lyu, L., Zhang, Z., Yuan, H., Fu, X., & Li, J. (2026). Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(29), 24142–24150. https://doi.org/10.1609/aaai.v40i29.39593

Issue

Section

AAAI Technical Track on Machine Learning VI