Fredholm integral equations for function approximation and the training of neural networks
Patrick Gelß, Aizhan Issagali, Ralf Kornhuber
TL;DR
This paper introduces a Fredholm-integral equation framework for function approximation and training of large, high-dimensional shallow neural networks, recasting the discrete, nonlinear training problem into a linear continuous problem via Ritz–Galerkin discretization and Tikhonov regularization. By interpreting networks as mean-field like kernels and employing functional tensor networks, the authors solve high-dimensional linear systems efficiently with tensor-train methods, and then sample discrete network parameters to form predictive models. The approach yields competitive results on bank note authentication, concrete strength prediction, and MNIST classification, demonstrating practical viability without bespoke feature engineering. The work opens avenues for deep Fredholm networks and further enhancements in tensor factorization, sampling strategies, and mean-field analyses, offering a scalable alternative to conventional gradient-based training in certain regimes.
Abstract
We present a novel and mathematically transparent approach to function approximation and the training of large, high-dimensional neural networks, based on the approximate least-squares solution of associated Fredholm integral equations of the first kind by Ritz-Galerkin discretization, Tikhonov regularization and tensor-train methods. Practical application to supervised learning problems of regression and classification type confirm that the resulting algorithms are competitive with state-of-the-art neural network-based methods.
