Towards an Algebraic Framework For Approximating Functions Using Neural Network Polynomials
Shakil Rafi, Joshua Lee Padgett, Ukash Nakarmi
TL;DR
The paper develops an algebraic framework for neural network objects, treating feedforward nets as mathematical entities that can mimic real-valued operations. It constructs a hierarchy of neural networks, including $\mathsf{Pwr}_n^{q,\varepsilon}$, $\mathsf{Pnm}^{q,\varepsilon}_{n,C}$, $\mathsf{Xpn}^{q,\varepsilon}_n$, $\mathsf{Csn}^{q,\varepsilon}_n$, and $\mathsf{Sne}^{q,\varepsilon}_n$, and combines them with trapezoidal integration and interpolation tools to realize $1$-D function approximation and $\int_a^b e^x\,dx$ via $\mathsf{E}^{N,h,q,\varepsilon}_n$. The work provides explicit depth, width, and parameter bounds, demonstrating polynomial growth in network size relative to target accuracy measured in the $1$-norm, and paves a path for algebraic manipulation of neural networks as approximants to classical functions. It also introduces tunneling networks and network-diagrambased representations to manage unequal depths and sums, enabling scalable construction of neural-network-based approximants for polynomials, exponentials, and trigonometric functions. The framework aims to support applications in solving PDEs and related problems by providing tractable, algebraic neural approximants with provable error controls.
Abstract
We make the case for neural network objects and extend an already existing neural network calculus explained in detail in Chapter 2 on \cite{bigbook}. Our aim will be to show that, yes, indeed, it makes sense to talk about neural network polynomials, neural network exponentials, sine, and cosines in the sense that they do indeed approximate their real number counterparts subject to limitations on certain of their parameters, $q$, and $\varepsilon$. While doing this, we show that the parameter and depth growth are only polynomial on their desired accuracy (defined as a 1-norm difference over $\mathbb{R}$), thereby showing that this approach to approximating, where a neural network in some sense has the structural properties of the function it is approximating is not entire intractable.
