The Representational Geometry of Number
Zhimin Hu, Lanhao Niu, Sashank Varma
TL;DR
The paper investigates whether conceptual representations share a common geometric framework or decompose into task-specific subspaces. Using numbers as a testbed and large language models as high‑dimensional substrates, it reveals a stable, shared relational structure across tasks that is preserved through linear transformations, while task-specific representations occupy largely separable subspaces. Low Procrustes disparity ($M^2 \approx 0.010$) and high SVCCA compatibility ($\approx 0.80$–$0.90$) indicate that, despite subspace separation, the underlying relations among numbers are coherent and transferable. These findings provide a mechanistic, geometry‑driven account of how representations balance shared structure with task flexibility, with implications for conceptual spaces theory and cognitive science, including insights into why certain numerical abilities may falter in dyscalculia.
Abstract
A central question in cognitive science is whether conceptual representations converge onto a shared manifold to support generalization, or diverge into orthogonal subspaces to minimize task interference. While prior work has discovered evidence for both, a mechanistic account of how these properties coexist and transform across tasks remains elusive. We propose that representational sharing lies not in the concepts themselves, but in the geometric relations between them. Using number concepts as a testbed and language models as high-dimensional computational substrates, we show that number representations preserve a stable relational structure across tasks. Task-specific representations are embedded in distinct subspaces, with low-level features like magnitude and parity encoded along separable linear directions. Crucially, we find that these subspaces are largely transformable into one another via linear mappings, indicating that representations share relational structure despite being located in distinct subspaces. Together, these results provide a mechanistic lens of how language models balance the shared structure of number representation with functional flexibility. It suggests that understanding arises when task-specific transformations are applied to a shared underlying relational structure of conceptual representations.
