A Survey on Universal Approximation Theorems
Midhun T Augustine
TL;DR
This survey analyzes universal approximation theorems (UATs) for feedforward neural networks, tracing results from early function-approximation theory (Taylor, Fourier, Weierstrass, Kolmogorov–Arnold) to modern NN-based density results in spaces like $\mathcal{C}(\mathbb{X})$ and $L^p$. It differentiates between arbitrary width (bounded depth) and arbitrary depth (bounded width) frameworks, highlighting that nonpolynomial activations render shallow networks universal and establishing width thresholds (e.g., $W\le n+4$ and $W^* = \max\{n+1, m\}$) for universal approximation. The paper also documents historical milestones (Cun, Lapedes–Farber, Cybenko, Hornik, Leshno) and clarifies how depth contributes to expressivity beyond width limitations, with implications for architecture design. By connecting classical approximation theory to NN expressivity and outlining extensions to other architectures, the work provides a consolidated reference for researchers assessing the theoretical capabilities of neural networks in real-world settings.
Abstract
This paper discusses various theorems on the approximation capabilities of neural networks (NNs), which are known as universal approximation theorems (UATs). The paper gives a systematic overview of UATs starting from the preliminary results on function approximation, such as Taylor's theorem, Fourier's theorem, Weierstrass approximation theorem, Kolmogorov - Arnold representation theorem, etc. Theoretical and numerical aspects of UATs are covered from both arbitrary width and depth.
