Table of Contents
Fetching ...

Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective

Di Jin, Jingyi Cao, Xiaobao Wang, Bingdao Feng, Dongxiao He, Longbiao Wang, Jianwu Dang

TL;DR

This work addresses Graph Anomaly Detection with contrastive learning, identifying interfering edges as a key source of noise that degrades training. It introduces CVGAD, which combines multi-scale anomaly awareness (node-subgraph and node-node contrasts on anomalous and clean views) with a progressive purification mechanism that iteratively removes high-interference edges based on edge scores derived from node-level signals. The approach achieves stronger ROC-AUC performance than seven baselines across five datasets, supported by thorough ablations and parameter analyses. By jointly mitigating interference and leveraging dual-view contrast, CVGAD offers a robust framework for reliable graph anomaly detection with practical implications for security and fraud detection.

Abstract

Graph anomaly detection aims to identify unusual patterns in graph-based data, with wide applications in fields such as web security and financial fraud detection. Existing methods typically rely on contrastive learning, assuming that a lower similarity between a node and its local subgraph indicates abnormality. However, these approaches overlook a crucial limitation: the presence of interfering edges invalidates this assumption, since it introduces disruptive noise that compromises the contrastive learning process. Consequently, this limitation impairs the ability to effectively learn meaningful representations of normal patterns, leading to suboptimal detection performance. To address this issue, we propose a Clean-View Enhanced Graph Anomaly Detection framework (CVGAD), which includes a multi-scale anomaly awareness module to identify key sources of interference in the contrastive learning process. Moreover, to mitigate bias from the one-step edge removal process, we introduce a novel progressive purification module. This module incrementally refines the graph by iteratively identifying and removing interfering edges, thereby enhancing model performance. Extensive experiments on five benchmark datasets validate the effectiveness of our approach.

Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective

TL;DR

This work addresses Graph Anomaly Detection with contrastive learning, identifying interfering edges as a key source of noise that degrades training. It introduces CVGAD, which combines multi-scale anomaly awareness (node-subgraph and node-node contrasts on anomalous and clean views) with a progressive purification mechanism that iteratively removes high-interference edges based on edge scores derived from node-level signals. The approach achieves stronger ROC-AUC performance than seven baselines across five datasets, supported by thorough ablations and parameter analyses. By jointly mitigating interference and leveraging dual-view contrast, CVGAD offers a robust framework for reliable graph anomaly detection with practical implications for security and fraud detection.

Abstract

Graph anomaly detection aims to identify unusual patterns in graph-based data, with wide applications in fields such as web security and financial fraud detection. Existing methods typically rely on contrastive learning, assuming that a lower similarity between a node and its local subgraph indicates abnormality. However, these approaches overlook a crucial limitation: the presence of interfering edges invalidates this assumption, since it introduces disruptive noise that compromises the contrastive learning process. Consequently, this limitation impairs the ability to effectively learn meaningful representations of normal patterns, leading to suboptimal detection performance. To address this issue, we propose a Clean-View Enhanced Graph Anomaly Detection framework (CVGAD), which includes a multi-scale anomaly awareness module to identify key sources of interference in the contrastive learning process. Moreover, to mitigate bias from the one-step edge removal process, we introduce a novel progressive purification module. This module incrementally refines the graph by iteratively identifying and removing interfering edges, thereby enhancing model performance. Extensive experiments on five benchmark datasets validate the effectiveness of our approach.

Paper Structure

This paper contains 17 sections, 15 equations, 4 figures, 2 tables.

Figures (4)

  • Figure 1: (a) A toy example illustrating GAD problems based on contrastive learning. In the whole graph, the red dashed line represents an anomalous edge—a connection that did not originally exist between nodes. The red node signifies an anomalous node, whose features are altered. Both of these anomalies introduce interfering edges (shown as red solid lines), which disrupt the process of generating positive instance pairs. The middle subgraph is sampled through RWR starting from node 3, after which the subgraph embedding for node 3 is obtained through further processing; (b) The proportion of interfering edges under different similarity scores based on raw features on the Cora dataset; (c) The proportion of interfering edges under different similarity scores based on GCN-aggregated features on the Cora dataset.
  • Figure 2: Overview of CVGAD. It consists of three primary components: (1) Multi-scale anomaly awareness: NS contrast and NN contrast are conducted on both the anomalous graph and clean graph, jointly training the model; (2) Progressive purification: Node contrast scores are calculated on the anomalous graph. Based on these scores, an interference-sensitive edge detection matrix is generated. The edges with the top-K highest interference scores are removed to create a cleaner graph. This module, together with the multi-scale anomaly awareness component, is trained iteratively to incrementally improve model performance; (3) Score calculation: The contrast score and the detection score are combined to generate the final node anomaly value.
  • Figure 3: Sensibility analysis of balance parameters.
  • Figure 4: Proportion of interfering edges removed.