Deep Learning without Global Optimization by Random Fourier Neural Networks
Owen Davis, Gianluca Geraci, Mohammad Motamed
TL;DR
This paper proposes a global-optimization-free training algorithm for deep residual networks with random Fourier (complex exponential) activations, termed random Fourier neural networks (rFNNs). It introduces a block-by-block training scheme where each block learns a residual correction using optimally derived frequency distributions and convex amplitude fitting, aided by adaptive Metropolis within Gibbs sampling to update frequencies. The approach achieves or surpasses the known theoretical approximation rates for rFNNs, learns high-frequency and multiscale features efficiently, and provides interpretable frequency decompositions, while avoiding Gibbs phenomena in discontinuous targets. Empirical results on discontinuous and multidimensional functions illustrate faster convergence and superior approximation relative to global optimization baselines, with potential extensions to uncertainty quantification and vector-valued tasks.
Abstract
We introduce a new training algorithm for deep neural networks that utilize random complex exponential activation functions. Our approach employs a Markov Chain Monte Carlo sampling procedure to iteratively train network layers, avoiding global and gradient-based optimization while maintaining error control. It consistently attains the theoretical approximation rate for residual networks with complex exponential activation functions, determined by network complexity. Additionally, it enables efficient learning of multiscale and high-frequency features, producing interpretable parameter distributions. Despite using sinusoidal basis functions, we do not observe Gibbs phenomena in approximating discontinuous target functions.
