Nonlinear classification of neural manifolds with contextual information
Francesca Mignacco, Chi-Ning Chou, SueYeon Chung
TL;DR
The paper tackles nonlinear classification of neural manifolds by introducing context-dependent gating to enable nonlinear readouts, addressing the limitations of linear manifold capacity in early layers. It develops a tractable theoretical framework that models neural representations as manifolds embedded in high-dimensional space and gates decisions via 2^K context hyperplanes, yielding an exact capacity formula $\frac{1}{\alpha^*(K,\Phi,\gamma)}=\mathbb{E}_{y,\xi,R}[\max_{c}\min_{H_c} (1/P)\sum_{\mu} ||H^\mu_c-\xi^\mu_c||^2]$, capturing the interplay between manifold geometry, context correlations $\Phi$, and margin $\gamma$. The authors validate the theory on synthetic data (random points, spherical manifolds, Gaussian mixtures) and apply it to ResNet-50 features under supervised and SimCLR objectives, revealing progressive, context-driven untangling across layers and improvements when using informative context directions (e.g., principal components). This framework offers a scalable, data-driven way to quantify context-dependent computation across scales, datasets, and models, with potential applications to biological neural data and beyond.
Abstract
Understanding how neural systems efficiently process information through distributed representations is a fundamental challenge at the interface of neuroscience and machine learning. Recent approaches analyze the statistical and geometrical attributes of neural representations as population-level mechanistic descriptors of task implementation. In particular, manifold capacity has emerged as a promising framework linking population geometry to the separability of neural manifolds. However, this metric has been limited to linear readouts. To address this limitation, we introduce a theoretical framework that leverages latent directions in input space, which can be related to contextual information. We derive an exact formula for the context-dependent manifold capacity that depends on manifold geometry and context correlations, and validate it on synthetic and real data. Our framework's increased expressivity captures representation reformatting in deep networks at early stages of the layer hierarchy, previously inaccessible to analysis. As context-dependent nonlinearity is ubiquitous in neural systems, our data-driven and theoretically grounded approach promises to elucidate context-dependent computation across scales, datasets, and models.
