Neural population geometry and optimal coding of tasks with shared latent structure
Albert J. Wakhloo, Will Slatton, SueYeon Chung
TL;DR
The paper introduces a Gaussian-equivalence, geometry-driven theory that links population-level neural activity statistics to the generalization performance of linear readouts across tasks tied to a shared latent structure. It identifies four key geometric terms—$c$ (neural-latent correlation), $f$ (signal-signal factorization), $s$ (signal-noise factorization), and $PR(\Psi)$ (neural dimensionality)—that fully determine cross-task performance and predicts that disentangled representations are optimal. The authors demonstrate how optimal codes compress less informative latent variables when data are scarce and expand them as data become abundant, with the eigen-spectrum becoming flatter as experience grows. They validate the theory on synthetic MLPs and macaque V4/IT data, showing accurate predictions of readout generalization and revealing distinct geometric signatures along the ventral stream. The work provides a principled link between neural population geometry and multi-task learning, offering testable predictions for neural coding and learning dynamics across both artificial and biological systems.
Abstract
Humans and animals can recognize latent structures in their environment and apply this information to efficiently navigate the world. However, it remains unclear what aspects of neural activity contribute to these computational capabilities. Here, we develop an analytical theory linking the geometry of a neural population's activity to the generalization performance of a linear readout on a set of tasks that depend on a common latent structure. We show that four geometric measures of the activity determine performance across tasks. Using this theory, we find that experimentally observed disentangled representations naturally emerge as an optimal solution to the multi-task learning problem. When data is scarce, these optimal neural codes compress less informative latent variables, and when data is abundant, they expand these variables in the state space. We validate our theory using macaque ventral stream recordings. Our results therefore tie population geometry to multi-task learning.
