On the implicit regularization of Langevin dynamics with projected noise
Govind Menon, Austin J. Stromme, Adrien Vacher
TL;DR
We study Langevin dynamics with diffusion projected onto directions orthogonal to a compact isometric group action $G\subset O(d)$ to model symmetry-induced over-parameterization. The main result proves that, starting from a $G$-invariant distribution, the projected-noise SDE is equivalent in law to an isotropic Langevin SDE with an additional drift $- (\alpha^2-\beta^2)\nabla \log \mathrm{vol}\,\mathcal{O}_{Y_t}$, i.e. a drift by the mean curvature $H(x)=-\nabla \log \mathrm{vol}\,\mathcal{O}_x$. This drift biases trajectories toward orbits of smaller embedded volume, revealing a geometry-driven implicit regularization tied to the group action. The authors construct a coupling via a process on $G$ to relate the two dynamics, analyze concrete group actions (radial $SO(d)$, eigenvalue-conjugation, and Bures–Wasserstein-type actions), and provide a PDE-based alternative proof, illustrating how model symmetries induce bias through differential volume/curvature terms with potential implications for architecture-aware learning dynamics.
Abstract
We study Langevin dynamics with noise projected onto the directions orthogonal to an isometric group action. This mathematical model is introduced to shed new light on the effects of symmetry on stochastic gradient descent for over-parametrized models. Our main result identifies a novel form of implicit regularization: when the initial and target density are both invariant under the group action, Langevin dynamics with projected noise is equivalent in law to Langevin dynamics with isotropic diffusion but with an additional drift term proportional to the negative log volume of the group orbit. We prove this result by constructing a coupling of the two processes via a third process on the group itself, and identify the additional drift as the mean curvature of the orbits.
