Learning Time-Scale Invariant Population-Level Neural Representations
Eshani Patel, Yisong Yue, Geeling Chau
TL;DR
The paper addresses the challenge of generalizing population-level neural representations across varying input time-scales in iEEG data. It introduces Time-scale Augmented Pretraining (TSAP) on top of the Population Transformer (PopT), using a BrainBERT temporal encoder, and exposes the model to multiple interval lengths during pretraining. Across Word Onset and Sentence Onset tasks on BrainTreeBank data, TSAP closes the performance gap caused by time-scale mismatches and often exceeds optimally matched baselines, even for unseen scales such as held-out 3-second intervals. Embedding analyses show that TSAP reduces time-scale clustering in the representation space, supporting more invariant, transferable population-level neural representations for neuroscience and brain–computer interface applications.
Abstract
General-purpose foundation models for neural time series can help accelerate neuroscientific discoveries and enable applications such as brain computer interfaces (BCIs). A key component in scaling these models is population-level representation learning, which leverages information across channels to capture spatial as well as temporal structure. Population-level approaches have recently shown that such representations can be both efficient to learn on top of pretrained temporal encoders and produce useful representations for decoding a variety of downstream tasks. However, these models remain sensitive to mismatches in preprocessing, particularly on time-scales, between pretraining and downstream settings. We systematically examine how time-scale mismatches affects generalization and find that existing representations lack invariance. To address this, we introduce Time-scale Augmented Pretraining (TSAP), which consistently improves robustness to different time-scales across decoding tasks and builds invariance in the representation space. These results highlight handling preprocessing diversity as a key step toward building generalizable neural foundation models.
