Population Transformer: Learning Population-level Representations of Neural Activity
Geeling Chau, Christopher Wang, Sabera Talukder, Vighnesh Subramaniam, Saraswati Soedarmadji, Yisong Yue, Boris Katz, Andrei Barbu
TL;DR
This paper tackles learning population-level representations from neural time-series data with highly variable electrode configurations. It introduces Population Transformer (PopT), a modular transformer-based spatial aggregator that sits on top of frozen temporal embeddings and is pretrained with two discriminative self-supervised objectives. Pretraining yields subject-generic, spatial-contextual channel representations that improve downstream decoding across iEEG and EEG tasks while reducing data and compute requirements, and generalizes to unseen subjects. The authors also provide interpretability tools to map connectivity and candidate functional brain regions from PopT weights, and release pretrained models and code for off-the-shelf use in multi-channel neural decoding.
Abstract
We present a self-supervised framework that learns population-level codes for arbitrary ensembles of neural recordings at scale. We address key challenges in scaling models with neural time-series data, namely, sparse and variable electrode distribution across subjects and datasets. The Population Transformer (PopT) stacks on top of pretrained temporal embeddings and enhances downstream decoding by enabling learned aggregation of multiple spatially-sparse data channels. The pretrained PopT lowers the amount of data required for downstream decoding experiments, while increasing accuracy, even on held-out subjects and tasks. Compared to end-to-end methods, this approach is computationally lightweight, while achieving similar or better decoding performance. We further show how our framework is generalizable to multiple time-series embeddings and neural data modalities. Beyond decoding, we interpret the pretrained and fine-tuned PopT models to show how they can be used to extract neuroscience insights from large amounts of data. We release our code as well as a pretrained PopT to enable off-the-shelf improvements in multi-channel intracranial data decoding and interpretability. Code is available at https://github.com/czlwang/PopulationTransformer.
