How does training shape the Riemannian geometry of neural network representations?
Jacob A. Zavatone-Veth, Sheng Yang, Julian A. Rubinfien, Cengiz Pehlevan
TL;DR
The paper investigates how training reshapes the Riemannian geometry of neural representations by studying the metric induced on input space by neural feature maps. It establishes an infinite-width baseline where shallow networks produce spherically symmetric metrics, and empirically shows that training magnifies volume elements near decision boundaries across shallow, deep, and self-supervised settings. The results suggest that feature learning exploits nonlinear geometry to enhance discriminability near boundaries, offering a framework to understand and design geometric inductive biases. This geometry-centric perspective provides groundwork for principled analysis of generalization, robustness, and kernel-learning methods in neural representations.
Abstract
In machine learning, there is a long history of trying to build neural networks that can learn from fewer example data by baking in strong geometric priors. However, it is not always clear a priori what geometric constraints are appropriate for a given task. Here, we explore the possibility that one can uncover useful geometric inductive biases by studying how training molds the Riemannian geometry induced by unconstrained neural network feature maps. We first show that at infinite width, neural networks with random parameters induce highly symmetric metrics on input space. This symmetry is broken by feature learning: networks trained to perform classification tasks learn to magnify local areas along decision boundaries. This holds in deep networks trained on high-dimensional image classification tasks, and even in self-supervised representation learning. These results begin to elucidate how training shapes the geometry induced by unconstrained neural network feature maps, laying the groundwork for an understanding of this richly nonlinear form of feature learning.
