Expressivity of Neural Networks with Random Weights and Learned Biases
Ezekiel Williams, Alexandre Payeur, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Matthew G. Perich, Luca Mazzucato, Guillaume Lajoie
TL;DR
This work shows that neural networks can express a wide class of functions and dynamical trajectories even when all weights are fixed randomly and only biases are learned. By introducing γ-bias-learning activations and leveraging masking-like arguments, the authors prove universal approximation for bias-learning feedforward networks and finite-horizon trajectory approximation for bias-learning recurrent networks on compact sets. They complement the theory with extensive simulations in multi-task learning, dynamical system forecasting, and motor control, illustrating task-specific organization and comparing bias learning to masking approaches. The results have implications for neuroscience, where bias modulation can reconfigure dynamics without synaptic changes, and AI, where efficient fine-tuning via biases or prefixes may achieve broad adaptability. Overall, the paper links bias-centric learning to universal expressivity and provides both theoretical and empirical grounding for bias-driven adaptation in neural systems.
Abstract
Landmark universal function approximation results for neural networks with trained weights and biases provided the impetus for the ubiquitous use of neural networks as learning models in neuroscience and Artificial Intelligence (AI). Recent work has extended these results to networks in which a smaller subset of weights (e.g., output weights) are tuned, leaving other parameters random. However, it remains an open question whether universal approximation holds when only biases are learned, despite evidence from neuroscience and AI that biases significantly shape neural responses. The current paper answers this question. We provide theoretical and numerical evidence demonstrating that feedforward neural networks with fixed random weights can approximate any continuous function on compact sets. We further show an analogous result for the approximation of dynamical systems with recurrent neural networks. Our findings are relevant to neuroscience, where they demonstrate the potential for behaviourally relevant changes in dynamics without modifying synaptic weights, as well as for AI, where they shed light on recent fine-tuning methods for large language models, like bias and prefix-based approaches.
