Latent Variable Double Gaussian Process Model for Decoding Complex Neural Data
Navid Ziaei, Joshua J. Stim, Melanie D. Goodman-Keiser, Scott Sponheim, Alik S. Widge, Sasoun Krikorian, Ali Yousefi
TL;DR
The paper tackles decoding complex neural data with limited samples by introducing a Latent Variable Double Gaussian Process (GP) that models both neural activity and stimuli as outputs of two GPs driven by a shared latent representation $\mathbf{X} \in \mathbb{R}^{N\times Q}$. It uses doubly stochastic variational inference with inducing points to compute a tractable Evidence Lower Bound (ELBO) and infer a robust latent manifold, enabling probabilistic predictions and uncertainty quantification. On the VerbMem EEG dataset, the model achieves a test discrimination accuracy of $92\%$ and an average decoding accuracy of $76.4\%$, outperforming XGBoost with a higher F-measure ($0.766$ vs $0.726$). The work demonstrates a scalable, non-parametric framework that reveals latent structure in neural data and supports accurate decoding while providing principled uncertainty estimates, with open-source tooling to foster adoption and further development.
Abstract
Non-parametric models, such as Gaussian Processes (GP), show promising results in the analysis of complex data. Their applications in neuroscience data have recently gained traction. In this research, we introduce a novel neural decoder model built upon GP models. The core idea is that two GPs generate neural data and their associated labels using a set of low-dimensional latent variables. Under this modeling assumption, the latent variables represent the underlying manifold or essential features present in the neural data. When GPs are trained, the latent variable can be inferred from neural data to decode the labels with a high accuracy. We demonstrate an application of this decoder model in a verbal memory experiment dataset and show that the decoder accuracy in predicting stimulus significantly surpasses the state-of-the-art decoder models. The preceding performance of this model highlights the importance of utilizing non-parametric models in the analysis of neuroscience data.
