The Field Equations of Penalized non-Parametric Regression
Sven Pappert
TL;DR
The paper reframes penalized non-parametric regression as a variational problem in which the minimizer of a risk functional consisting of a fitness term and a gradient-based penalty satisfies Euler–Lagrange field equations. Focusing on the MSE fitness with a squared-$\\ell^2$ gradient penalty, the minimizer solves a second-order inhomogeneous PDE with inhomogeneity given by $\\mathbb{E}(Y|X)$ and a coefficient involving the feature distribution $f_X$, reducing to the Rudin-Osher-Fatemi model when $f_X$ is uniform. The framework unifies linear Ridge regularization as a special case and connects to smoothing splines and backward-energy penalties, with clear boundary conditions and conditions ensuring minimality. It also discusses practical implications for post-processing estimators, potential extensions to other penalties, and avenues for future research on convergence and universal approximation.
Abstract
We view penalized risks through the lens of the calculus of variations. We consider risks comprised of a fitness-term (e.g. MSE) and a gradient-based penalty. After establishing the Euler-Lagrange field equations as a systematic approach to finding minimizers of risks involving only first derivatives, we proceed to exemplify this approach to the MSE penalized by the integral over the squared l2-norm of the gradient of the regression function. The minimizer of this risk is given as the solution to a second order inhomogeneous PDE, where the inhomogeneity is given as the conditional expectation of the target variable conditioned on the features. We discuss properties of the field equations and practical implications thereof, which also apply to the classical Ridge penalty for linear models, and embed our findings into the existing literature. In particular, we find that we can recover the Rudin-Osher-Fatemi model for image-denoising, if we consider the features as deterministic and evenly distributed. Last, we outline several directions for future research.
