SLADE: Detecting Dynamic Anomalies in Edge Streams without Labels via Self-Supervised Learning
Jongha Lee, Sunwoo Kim, Kijung Shin
TL;DR
SLADE addresses dynamic anomaly detection on continuous-time dynamic graphs without labels by learning stable long-term node interaction patterns and short-term regeneration through two self-supervised tasks. It employs a memory-augmented architecture with a GRU-based memory updater and a TGAT-based memory generator, achieving constant-time per-edge inference while updating node representations incrementally. Empirical results on four real-world datasets show SLADE outperforming nine baselines (including supervised ones) in AUC, with strong ablation results highlighting the value of both the temporal contrast and memory-generation objectives. This approach enables scalable, real-time anomaly detection in evolving edge streams, with broad applicability to social, financial, and communication networks.
Abstract
To detect anomalies in real-world graphs, such as social, email, and financial networks, various approaches have been developed. While they typically assume static input graphs, most real-world graphs grow over time, naturally represented as edge streams. In this context, we aim to achieve three goals: (a) instantly detecting anomalies as they occur, (b) adapting to dynamically changing states, and (c) handling the scarcity of dynamic anomaly labels. In this paper, we propose SLADE (Self-supervised Learning for Anomaly Detection in Edge Streams) for rapid detection of dynamic anomalies in edge streams, without relying on labels. SLADE detects the shifts of nodes into abnormal states by observing deviations in their interaction patterns over time. To this end, it trains a deep neural network to perform two self-supervised tasks: (a) minimizing drift in node representations and (b) generating long-term interaction patterns from short-term ones. Failure in these tasks for a node signals its deviation from the norm. Notably, the neural network and tasks are carefully designed so that all required operations can be performed in constant time (w.r.t. the graph size) in response to each new edge in the input stream. In dynamic anomaly detection across four real-world datasets, SLADE outperforms nine competing methods, even those leveraging label supervision.
