Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models

Jiayi Luo; Qingyun Sun; Lingjuan Lyu; Ziwei Zhang; Haonan Yuan; Xingcheng Fu; Jianxin Li

Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models

Jiayi Luo, Qingyun Sun, Lingjuan Lyu, Ziwei Zhang, Haonan Yuan, Xingcheng Fu, Jianxin Li

TL;DR

Graph Foundation Models enable broad transfer across tasks but introduce backdoor risks during pre-training. The paper presents Gfm-Ba, a three-module backdoor attack comprising label-free trigger association via prototype embeddings, a node-adaptive trigger generator, and persistent anchoring to fine-tuning-insensitive parameters, enabling targeted manipulation that survives downstream adaptation. Across five datasets and three victim GFMs, Gfm-Ba achieves superior attack effectiveness, maintains clean performance, resists purification, and persists under fine-tuning, outperforming baselines in both targeted and non-targeted settings. This work exposes a critical security vulnerability in GFMs and motivates the development of defenses for pre-trained graph models and their downstream deployments.

Abstract

Graph Foundation Models (GFMs) are pre-trained on diverse source domains and adapted to unseen targets, enabling broad generalization for graph machine learning. Despite that GFMs have attracted considerable attention recently, their vulnerability to backdoor attacks remains largely underexplored. A compromised GFM can introduce backdoor behaviors into downstream applications, posing serious security risks. However, launching backdoor attacks against GFMs is non-trivial due to three key challenges. (1) Effectiveness: Attackers lack knowledge of the downstream task during pre-training, complicating the assurance that triggers reliably induce misclassifications into desired classes. (2) Stealthiness: The variability in node features across domains complicates trigger insertion that remains stealthy. (3) Persistence: Downstream fine-tuning may erase backdoor behaviors by updating model parameters. To address these challenges, we propose GFM-BA, a novel Backdoor Attack model against Graph Foundation Models. Specifically, we first design a label-free trigger association module that links the trigger to a set of prototype embeddings, eliminating the need for knowledge about downstream tasks to perform backdoor injection. Then, we introduce a node-adaptive trigger generator, dynamically producing node-specific triggers, reducing the risk of trigger detection while reliably activating the backdoor. Lastly, we develop a persistent backdoor anchoring module that firmly anchors the backdoor to fine-tuning-insensitive parameters, enhancing the persistence of the backdoor under downstream adaptation. Extensive experiments demonstrate the effectiveness, stealthiness, and persistence of GFM-BA.

Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models

TL;DR

Abstract

Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (7)