Geodesics in the Deep Linear Network
Alan Chen
TL;DR
The paper studies geodesics in the DLN geometry by viewing the end-to-end map as a Riemannian submersion from a balanced manifold $\\mathcal{M}$ to the observable space $(\\mathfrak{M}_d, g^N)$. It derives general ODEs for geodesics on $(\\mathfrak{M}_d, g^N)$ and gives explicit closed-form DLN geodesics in the special case where the endpoints' singular vectors are related by a fixed orthogonal matrix $Q \\in O_d$, by lifting horizontal straight lines. The explicit geodesic formula is presented in terms of the SVD components and reduces to simpler diagonal forms in special limits, aligned with the Bures-Wasserstein case $N=2$ and its diagonal limit. The work lays a geometric foundation for understanding training dynamics in DLN via geodesic distances and suggests open problems for arbitrary endpoints and rank-deficient regimes.
Abstract
We derive a general system of ODEs and associated explicit solutions in a special case for geodesics between full rank matrices in the deep linear network geometry. In the process, we characterize all horizontal straight lines in the invariant balanced manifold that remain geodesics under Riemannian submersion.
