Adversarial Spatio-Temporal Attention Networks for Epileptic Seizure Forecasting
Zan Li, Kyongmin Yeo, Wesley Gifford, Lara Marcuse, Madeline Fields, Bülent Yener
TL;DR
This work tackles epileptic seizure forecasting from multivariate EEG by introducing STAN, a cascaded Adversarial Spatio-Temporal Attention Network that jointly models spatial connectivity and temporal dynamics. The approach employs three cascaded attention blocks with alternating spatial and temporal modules and a gradient-penalized adversarial discriminator to learn robust preictal representations from clearly defined 15-minute windows, enabling reliable alarms typically 15–45 minutes before onset. Empirical evaluation on CHB-MIT and MSSM datasets demonstrates state-of-the-art sensitivity with dramatically reduced false-alarm rates (e.g., 96.6% Sn, 0.011/h FDR on CHB-MIT; 94.2% Sn, 0.063/h FDR on MSSM) while maintaining edge-friendly efficiency (2.3M parameters, 45 ms latency, 180 MB). The framework supports real-time monitoring, subject-specific adaptability without individualized training, and broad applicability to healthcare time series forecasting, offering a practical path toward precision medicine in epilepsy management and beyond.
Abstract
Forecasting epileptic seizures from multivariate EEG signals represents a critical challenge in healthcare time series prediction, requiring high sensitivity, low false alarm rates, and subject-specific adaptability. We present STAN, an Adversarial Spatio-Temporal Attention Network that jointly models spatial brain connectivity and temporal neural dynamics through cascaded attention blocks with alternating spatial and temporal modules. Unlike existing approaches that assume fixed preictal durations or separately process spatial and temporal features, STAN captures bidirectional dependencies between spatial and temporal patterns through a unified cascaded architecture. Adversarial training with gradient penalty enables robust discrimination between interictal and preictal states learned from clearly defined 15-minute preictal windows. Continuous 90-minute pre-seizure monitoring reveals that the learned spatio-temporal attention patterns enable early detection: reliable alarms trigger at subject-specific times (typically 15-45 minutes before onset), reflecting the model's capacity to capture subtle preictal dynamics without requiring individualized training. Experiments on two benchmark EEG datasets (CHB-MIT scalp: 8 subjects, 46 events; MSSM intracranial: 4 subjects, 14 events) demonstrate state-of-the-art performance: 96.6% sensitivity with 0.011 false detections per hour and 94.2% sensitivity with 0.063 false detections per hour, respectively, while maintaining computational efficiency (2.3M parameters, 45 ms latency, 180 MB memory) for real-time edge deployment. Beyond epilepsy, the proposed framework provides a general paradigm for spatio-temporal forecasting in healthcare and other time series domains where individual heterogeneity and interpretability are crucial.
