STAMP: Spatial-Temporal Adapter with Multi-Head Pooling
Brad Shook, Abby Turner, Jieshi Chen, Michał Wiliński, Mononito Goswami, Jonathan Elmer, Artur Dubrawski
TL;DR
STAMP is a lightweight Spatial-Temporal Adapter that sits on frozen TSFMs to tackle EEG classification tasks by explicitly encoding spatial and temporal structure. It combines positional encodings, a criss-cross gated MMLP, and multi-head attention pooling to convert univariate TSFM embeddings into strong discriminative features, achieving performance on eight EEG benchmarks that rivals EEG foundation models while using only about $7.5\times 10^5$ trainable parameters. Ablation studies show that positional encoding and token mixing are essential, and the approach proves versatile across different TSFMs, underlining the practicality of reusing large pre-trained time-series models for EEG without extensive fine-tuning. These findings highlight STAMP's potential to extend to other multivariate time-series domains, offering a scalable, interpretable path to leveraging general-purpose foundation models in specialized biomedical tasks.
Abstract
Time series foundation models (TSFMs) pretrained on data from multiple domains have shown strong performance on diverse modeling tasks. Various efforts have been made to develop foundation models specific to electroencephalography (EEG) data, which records brain electrical activity as time series. However, no comparative analysis of EEG-specific foundation models (EEGFMs) versus general TSFMs has been performed on EEG-specific tasks. We introduce a novel Spatial-Temporal Adapter with Multi-Head Pooling (STAMP), which leverages univariate embeddings produced by a general TSFM, implicitly models spatial-temporal characteristics of EEG data, and achieves performance comparable to state-of-the-art EEGFMs. A comprehensive analysis is performed on 8 benchmark datasets of clinical tasks using EEG for classification, along with ablation studies. Our proposed adapter is lightweight in trainable parameters and flexible in the inputs it can accommodate, supporting easy modeling of EEG data using TSFMs.
