Two-hidden-layer ReLU neural networks and finite elements
Pengzhan Jin
TL;DR
This work addresses bridging ReLU neural networks and finite element function spaces by introducing a weak representation on convex polytope meshes, showing that two-hidden-layer networks can approximate piecewise linear FE functions with precisely computable neuron counts. It formalizes a constructive weak representation theorem for constant and linear FE, and demonstrates L^p approximation density by relating NN representations to FE spaces. In addition, it establishes a strict representation framework for tensor finite element functions using tensor neural networks, with explicit formulas and a concrete 2D example. Overall, the paper provides a computable, theory-grounded bridge between NN approximation and FE analysis, enabling rigorous L^p estimates and offering a path to efficient high-dimensional representations via tensor NNs.
Abstract
We point out that (continuous or discontinuous) piecewise linear functions on a convex polytope mesh can be represented by two-hidden-layer ReLU neural networks in a weak sense. In addition, the numbers of neurons of the two hidden layers required to weakly represent are accurately given based on the numbers of polytopes and hyperplanes involved in this mesh. The results naturally hold for constant and linear finite element functions. Such weak representation establishes a bridge between two-hidden-layer ReLU neural networks and finite element functions, and leads to a perspective for analyzing approximation capability of ReLU neural networks in $L^p$ norm via finite element functions. Moreover, we discuss the strict representation for tensor finite element functions via the recent tensor neural networks.
