Diffusion-Based Generation of Neural Activity from Disentangled Latent Codes
Jonathan D. McCart, Andrew R. Sedler, Christopher Versteeg, Domenick Mifsud, Mattia Rigotti-Thompson, Chethan Pandarinath
TL;DR
GNOCCHI introduces a diffusion-based, conditional generative model that learns disentangled latent codes $\mathbf{c} \in \mathbb{R}^L$ from time-series neural activity using an auxiliary encoder and a denoising network conditioned on $\mathbf{c}$; the forward process is $\tilde{\mathbf{x}}_i = \sqrt{\bar{\alpha}_i}\mathbf{x}_0 + \sqrt{1-\bar{\alpha}_i}\epsilon$ with $\epsilon \sim \mathcal{N}(0, I)$, and training uses score, reconstruction, and MMD losses to promote high-information conditioning. GNOCCHI is compared to LFADS on both synthetic motor-task data and real monkey M1 recordings, showing more structured and disentangled latent spaces and enabling accurate generation of samples for unseen behavioral conditions via latent navigation. The results demonstrate that GNOCCHI can linearly traverse latent axes to produce controlled changes in behavior while preserving other variables, outperforming LFADS in disentanglement and held-out generalization. This approach advances unsupervised discovery of interpretable neural representations and holds promise for data augmentation and brain-machine interface applications, with future work extending to jointly generating neural activity and behavior and to continuous-time, unconstrained data.
Abstract
Recent advances in recording technology have allowed neuroscientists to monitor activity from thousands of neurons simultaneously. Latent variable models are increasingly valuable for distilling these recordings into compact and interpretable representations. Here we propose a new approach to neural data analysis that leverages advances in conditional generative modeling to enable the unsupervised inference of disentangled behavioral variables from recorded neural activity. Our approach builds on InfoDiffusion, which augments diffusion models with a set of latent variables that capture important factors of variation in the data. We apply our model, called Generating Neural Observations Conditioned on Codes with High Information (GNOCCHI), to time series neural data and test its application to synthetic and biological recordings of neural activity during reaching. In comparison to a VAE-based sequential autoencoder, GNOCCHI learns higher-quality latent spaces that are more clearly structured and more disentangled with respect to key behavioral variables. These properties enable accurate generation of novel samples (unseen behavioral conditions) through simple linear traversal of the latent spaces produced by GNOCCHI. Our work demonstrates the potential of unsupervised, information-based models for the discovery of interpretable latent spaces from neural data, enabling researchers to generate high-quality samples from unseen conditions.
