The Curvature Rate λ: A Scalar Measure of Input-Space Sharpness in Neural Networks
Jacob Poschl
TL;DR
The paper introduces λ, a scalar curvature measure defined in input space as the exponential growth rate of higher-order input derivatives, estimated from low-order derivatives during training. By showing λ equals −log R for analytic functions and log Ω for bandlimited signals, the work unifies classical notions of smoothness and spectral content and extends this perspective to neural networks where λ tracks decision-boundary complexity. A Curvature Rate Regularization (CRR) is proposed to control λ directly, yielding flatter input-space geometry and improved calibration with minimal accuracy loss, and is competitive with Sharpness-Aware Minimization (SAM). This functional, parameterization-invariant framing offers a principled tool for characterizing and shaping neural representations, with implications for generalization, calibration, and robustness across tasks and architectures.
Abstract
Curvature influences generalization, robustness, and how reliably neural networks respond to small input perturbations. Existing sharpness metrics are typically defined in parameter space (e.g., Hessian eigenvalues) and can be expensive, sensitive to reparameterization, and difficult to interpret in functional terms. We introduce a scalar curvature measure defined directly in input space: the curvature rate λ, given by the exponential growth rate of higher-order input derivatives. Empirically, λ is estimated as the slope of log ||D^n f|| versus n for small n. This growth-rate perspective unifies classical analytic quantities: for analytic functions, λ corresponds to the inverse radius of convergence, and for bandlimited signals, it reflects the spectral cutoff. The same principle extends to neural networks, where λ tracks the emergence of high-frequency structure in the decision boundary. Experiments on analytic functions and neural networks (Two Moons and MNIST) show that λ evolves predictably during training and can be directly shaped using a simple derivative-based regularizer, Curvature Rate Regularization (CRR). Compared to Sharpness-Aware Minimization (SAM), CRR achieves similar accuracy while yielding flatter input-space geometry and improved confidence calibration. By grounding curvature in differentiation dynamics, λ provides a compact, interpretable, and parameterization-invariant descriptor of functional smoothness in learned models.
