DIVER-1 : Deep Integration of Vast Electrophysiological Recordings at Scale
Danny Dongyeop Han, Yonghyeon Gwon, Ahhyun Lucy Lee, Taeyang Lee, Seong Jin Lee, Jubin Choi, Sebin Lee, Jihyun Bang, Seungju Lee, David Keetae Park, Shinjae Yoo, Chun Kee Chung, Jiook Cha
TL;DR
This work tackles scaling electrophysiology foundation models (EFMs) for EEG and iEEG, domains where data are scarce and heterogeneous. It introduces DIVER-1, a family of self-supervised EFMs with architectural innovations such as any-variate attention, sliding temporal conditional positional encoding, spatio-temporal register tokens, and multi-domain reconstruction, trained on the largest electrophysiology corpora to date. The authors demonstrate data-constrained scaling laws for EFMs, showing that, at fixed compute, smaller models trained longer achieve better performance, and provide IsoLoss guidance for compute budgeting. DIVER-1 achieves state-of-the-art downstream decoding on iEEG and EEG benchmarks and exhibits robust generalization across modalities and dataset shifts, with ablations confirming the value of each architectural component. The study offers practical guidance for scaling EFMs and points to future directions in cross-subject learning and data-efficient fine-tuning to broaden applicability in neuroscience and clinical contexts.
Abstract
Electrophysiology signals such as EEG and iEEG are central to neuroscience, brain-computer interfaces, and clinical applications, yet existing foundation models remain limited in scale despite clear evidence that scaling improves performance. We introduce DIVER-1, a family of EEG and iEEG foundation models trained on the largest and most diverse corpus to date-5.3k hours of iEEG and 54k hours of EEG (1.6M channel-hours from over 17.7k subjects)-and scaled up to 1.82B parameters. We present the first systematic scaling law analysis for this domain, showing that they follow data-constrained scaling laws: for a given amount of data and compute, smaller models trained for extended epochs consistently outperform larger models trained briefly. This behavior contrasts with prior electrophysiology foundation models that emphasized model size over training duration. To achieve strong performance, we also design architectural innovations including any-variate attention, sliding temporal conditional positional encoding, and multi-domain reconstruction. DIVER-1 iEEG and EEG models each achieve state-of-the-art performance on their respective benchmarks, establishing a concrete guidelines for efficient scaling and resource allocation in electrophysiology foundation model development.
