Towards a "universal translator" for neural dynamics at single-cell, single-spike resolution
Yizi Zhang, Yanchen Wang, Donato Jimenez-Beneto, Zixuan Wang, Mehdi Azabou, Blake Richards, Olivier Winter, International Brain Laboratory, Eva Dyer, Liam Paninski, Cole Hurwitz
TL;DR
This work addresses the challenge of building a foundation model that can read neural activity across brain regions at single-cell, single-spike resolution. It introduces Multi-task-masking (MtM), a self-supervised training framework that alternates among masking schemes (causal, neuron, intra-region, inter-region) and uses a learnable prompt for mode switching at inference, enabling cross-scale representation learning. Evaluated on the International Brain Laboratory Neuropixels dataset, MtM consistently outperforms strong baselines (temporal masking) and shows strong multi-task generalization, especially when pretraining spans many animals and sessions; behavior decoding improves notably when conditioning on region-specific structure, with scaling benefits evident up to 34 pretraining sessions. The results suggest MtM as a viable path toward a universal translator of brain dynamics, capable of leveraging diverse, multi-region neural data to generalize to unseen animals and tasks, ultimately enabling broader, brain-wide analyses at fine temporal and cellular resolution.
Abstract
Neuroscience research has made immense progress over the last decade, but our understanding of the brain remains fragmented and piecemeal: the dream of probing an arbitrary brain region and automatically reading out the information encoded in its neural activity remains out of reach. In this work, we build towards a first foundation model for neural spiking data that can solve a diverse set of tasks across multiple brain areas. We introduce a novel self-supervised modeling approach for population activity in which the model alternates between masking out and reconstructing neural activity across different time steps, neurons, and brain regions. To evaluate our approach, we design unsupervised and supervised prediction tasks using the International Brain Laboratory repeated site dataset, which is comprised of Neuropixels recordings targeting the same brain locations across 48 animals and experimental sessions. The prediction tasks include single-neuron and region-level activity prediction, forward prediction, and behavior decoding. We demonstrate that our multi-task-masking (MtM) approach significantly improves the performance of current state-of-the-art population models and enables multi-task learning. We also show that by training on multiple animals, we can improve the generalization ability of the model to unseen animals, paving the way for a foundation model of the brain at single-cell, single-spike resolution.
