On Functional Dimension and Persistent Pseudodimension
J. Elisenda Grigsby, Kathryn Lindsey
TL;DR
This work addresses local complexity in fixed-architecture ReLU networks by introducing two local measures, $dim_{fun}(\theta)$ and $dim_{p.{VC\Delta}}(\mathcal{F},\theta)$, and establishing a fundamental inequality $dim_{fun}(\theta) \le dim_{p.{VC\Delta}}(\mathcal{F},\theta) \le \sup_{Z} r_{\mathbb{R}}(\mathbf{J}E^R_Z(\theta))$ that links parameter redundancy to the real rank of the algebraic Jacobian. It develops a batch-wise, algebraic framework involving the algebraic evaluation map $E_Z^R$, activation matrices $\alpha^R$ and $\alpha$, and a local batch-fiber structure, to derive concrete, locality-aware bounds on the persistent pseudodimension and its relation to functional dimension. The paper shows how the rank gap between polynomial-ring and real-valued Jacobians governs these bounds and discusses overparameterization as a regime where the gap may close, supported by McCoy’s theorem. A key conjecture is that for generic, overparameterized ReLU families, the local complexities coincide, providing a principled explanation for observed generalization behavior and its relation to double descent.
Abstract
For any fixed feedforward ReLU neural network architecture, it is well-known that many different parameter settings can determine the same function. It is less well-known that the degree of this redundancy is inhomogeneous across parameter space. In this work, we discuss two locally applicable complexity measures for ReLU network classes and what we know about the relationship between them: (1) the local functional dimension [14, 18], and (2) a local version of VC dimension that we call persistent pseudodimension. The former is easy to compute on finite batches of points; the latter should give local bounds on the generalization gap, which would inform an understanding of the mechanics of the double descent phenomenon [7].
