Statistical Learning Analysis of Physics-Informed Neural Networks
David A. Barajas-Solano
TL;DR
This work reframes physics-informed neural networks (PINNs) for IBVPs as a singular statistical learning problem by enforcing hard initial and boundary constraints and interpreting the physics residuals as an infinite data source. It shows that training aims to minimize a KL divergence between the PINN residual distribution $p(y\mid x, t, w) q(x, t)$ and the true zero-residual distribution $\delta(0) q(x, t)$, rather than merely regularizing the model. The Local Learning Coefficient (LLC) is introduced and numerically estimated via MCMC to characterize the flatness of PINN loss minima, with experiments on a heat equation IBVP yielding $\hat{\lambda}(w^\star) \approx 9.5$ despite a large parameter count ($d=20{,}601$), indicating very flat minima. The analysis has implications for uncertainty quantification and extrapolation in PINNs, suggesting a function-space view of Bayesian UQ and highlighting that residual data in PINNs acts as indirect data rather than a traditional regularizer.
Abstract
We study the training and performance of physics-informed learning for initial and boundary value problems (IBVP) with physics-informed neural networks (PINNs) from a statistical learning perspective. Specifically, we restrict ourselves to parameterizations with hard initial and boundary condition constraints and reformulate the problem of estimating PINN parameters as a statistical learning problem. From this perspective, the physics penalty on the IBVP residuals can be better understood not as a regularizing term bus as an infinite source of indirect data, and the learning process as fitting the PINN distribution of residuals $p(y \mid x, t, w) q(x, t) $ to the true data-generating distribution $δ(0) q(x, t)$ by minimizing the Kullback-Leibler divergence between the true and PINN distributions. Furthermore, this analysis show that physics-informed learning with PINNs is a singular learning problem, and we employ singular learning theory tools, namely the so-called Local Learning Coefficient (Lau et al., 2025) to analyze the estimates of PINN parameters obtained via stochastic optimization for a heat equation IBVP. Finally, we discuss implications of this analysis on the quantification of predictive uncertainty of PINNs and the extrapolation capacity of PINNs.
