Towards Anomaly-Aware Pre-Training and Fine-Tuning for Graph Anomaly Detection

Yunhui Liu; Jiashun Cheng; Yiqing Lin; Qizhuo Xie; Jia Li; Fugee Tsung; Hongzhi Yin; Tao Zheng; Jianhua Zhao; Tieke He

Towards Anomaly-Aware Pre-Training and Fine-Tuning for Graph Anomaly Detection

Yunhui Liu, Jiashun Cheng, Yiqing Lin, Qizhuo Xie, Jia Li, Fugee Tsung, Hongzhi Yin, Tao Zheng, Jianhua Zhao, Tieke He

TL;DR

This work tackles graph anomaly detection under severe label scarcity and local homophily disparity by introducing Anomaly-Aware Pre-Training and Fine-Tuning (APF). APF combines Rayleigh Quotient-guided subgraph sampling with dual spectral filters to learn anomaly-relevant representations during pre-training, and a granularity-adaptive fusion mechanism with anomaly-aware regularization during fine-tuning. Theoretical analysis in an anomalous stochastic block model framework supports linear separability under mild conditions, while extensive experiments on 10 GADBench datasets demonstrate strong, robust performance across diverse domains. The approach offers a scalable, interpretable pathway to effective GAD under realistic supervision constraints, with clear mechanisms to adapt to node- and dimension-level heterogeneity.

Abstract

Graph anomaly detection (GAD) has garnered increasing attention in recent years, yet remains challenging due to two key factors: (1) label scarcity stemming from the high cost of annotations and (2) homophily disparity at node and class levels. In this paper, we introduce Anomaly-Aware Pre-Training and Fine-Tuning (APF), a targeted and effective framework to mitigate the above challenges in GAD. In the pre-training stage, APF incorporates node-specific subgraphs selected via the Rayleigh Quotient, a label-free anomaly metric, into the learning objective to enhance anomaly awareness. It further introduces two learnable spectral polynomial filters to jointly learn dual representations that capture both general semantics and subtle anomaly cues. During fine-tuning, a gated fusion mechanism adaptively integrates pre-trained representations across nodes and dimensions, while an anomaly-aware regularization loss encourages abnormal nodes to preserve more anomaly-relevant information. Furthermore, we theoretically show that APF tends to achieve linear separability under mild conditions. Comprehensive experiments on 10 benchmark datasets validate the superior performance of APF in comparison to state-of-the-art baselines.

Towards Anomaly-Aware Pre-Training and Fine-Tuning for Graph Anomaly Detection

TL;DR

Abstract

Towards Anomaly-Aware Pre-Training and Fine-Tuning for Graph Anomaly Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (4)