Function and derivative approximation by shallow neural networks
Yuanyuan Li, Shuai Lu
TL;DR
This work analyzes function and derivative approximation for unknown targets on the unit cube $\Omega=(0,1)^d$ using shallow neural networks under a Tikhonov regularization framework. It introduces three network norms—the (extended) Barron norm, the variation norm, and the Radon-BV seminorm—and derives their interrelations and connections to Sobolev spaces, highlighting how the dimension $d$ affects embedding constants and error bounds. The authors establish approximation properties for each norm and provide rigorous $H^m(\Omega)$ error bounds for the regularized estimators, showing dimension-dependent rates that govern the accuracy of both function and derivative recovery. The results offer a unified view of regularization strategies for shallow nets and shed light on the dimensionality challenges, suggesting pathways to extend these insights to deep networks and broader inverse problems.
Abstract
We investigate a Tikhonov regularization scheme specifically tailored for shallow neural networks within the context of solving a classic inverse problem: approximating an unknown function and its derivatives within a unit cubic domain based on noisy measurements. The proposed Tikhonov regularization scheme incorporates a penalty term that takes three distinct yet intricately related network (semi)norms: the extended Barron norm, the variation norm, and the Radon-BV seminorm. These choices of the penalty term are contingent upon the specific architecture of the neural network being utilized. We establish the connection between various network norms and particularly trace the dependence of the dimensionality index, aiming to deepen our understanding of how these norms interplay with each other. We revisit the universality of function approximation through various norms, establish rigorous error-bound analysis for the Tikhonov regularization scheme, and explicitly elucidate the dependency of the dimensionality index, providing a clearer understanding of how the dimensionality affects the approximation performance and how one designs a neural network with diverse approximating tasks.
