RNNs perform task computations by dynamically warping neural representations
Arthur Pellegrino, Angus Chadwick
TL;DR
The paper introduces a Riemannian-geometric framework that links the topology of low-dimensional input manifolds to the geometry of the dynamical neural state manifolds via a pullback metric derived from adjoint dynamics. It demonstrates that recurrent networks solve time-dependent tasks by dynamically warping their internal representations, selectively compressing irrelevant inputs while aligning relevant variables with readouts. Across static and sequential tasks, the authors show that computations manifest as context-dependent warping of manifolds (e.g., circles to warped grids, 3D manifolds to line attractors, and hyper-tori for memory). This approach provides a principled, mathematically tractable way to interpret how geometry encodes computation in RNNs and offers a foundation for analyzing data-driven dynamical systems.
Abstract
Analysing how neural networks represent data features in their activations can help interpret how they perform tasks. Hence, a long line of work has focused on mathematically characterising the geometry of such "neural representations." In parallel, machine learning has seen a surge of interest in understanding how dynamical systems perform computations on time-varying input data. Yet, the link between computation-through-dynamics and representational geometry remains poorly understood. Here, we hypothesise that recurrent neural networks (RNNs) perform computations by dynamically warping their representations of task variables. To test this hypothesis, we develop a Riemannian geometric framework that enables the derivation of the manifold topology and geometry of a dynamical system from the manifold of its inputs. By characterising the time-varying geometry of RNNs, we show that dynamic warping is a fundamental feature of their computations.
