Neural Networks Trained by Weight Permutation are Universal Approximators
Yongqiang Cai, Gaohang Chen, Zhonghua Qiao
TL;DR
This work addresses whether neural networks trained by weight permutation can universally approximate continuous functions. It introduces a constructive proof for the universal approximation property of permutation-trained ReLU networks on one-dimensional targets, relying on a four-pair step-function approximator and a mechanism to annihilate unused parameters, with extensions to fixed linear output and to random initializations. The key contributions are (i) a rigorous UAP guarantee for both equidistant and random initializations, (ii) a detailed step/constant/linear approximator construction, (iii) analysis of approximation rates and the impact of initialization, and (iv) empirical demonstrations across 1D and modest higher-dimensional tasks that validate the theory and reveal permutation-driven learning patterns. The results have implications for hardware-friendly fixed-weight designs and offer new insights into network learning dynamics via permutation activity, suggesting practical and theoretical avenues for permutation-based training and analysis.
Abstract
The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a novel permutation-based training method, which exhibited a desired classification performance without modifying the exact weight values. In this paper, we provide a theoretical guarantee of this permutation training method by proving its ability to guide a ReLU network to approximate one-dimensional continuous functions. Our numerical results further validate this method's efficiency in regression tasks with various initializations. The notable observations during weight permutation suggest that permutation training can provide an innovative tool for describing network learning behavior.
