Ornstein-Uhlenbeck Adaptation as a Mechanism for Learning in Brains and Machines
Jesus Garcia Fernandez, Nasir Ahmad, Marcel van Gerven
TL;DR
Biological learning and neuromorphic hardware face challenges implementing exact gradient descent due to the need for precise gradients and non-local information. The authors propose Ornstein-Uhlenbeck adaptation (OUA), a gradient-free framework where parameter dynamics are governed by an OU process toward a mean that is updated by a global reward prediction error, enabling online exploration and exploitation in evolving environments. They validate OUA across supervised, reinforcement, recurrent, weather forecasting, and meta-learning tasks, showing robust learning and improved performance relative to baselines. The findings suggest OUA as a practical, hardware-friendly alternative to backpropagation with potential insights into noise-driven learning in the brain and scalable neuromorphic implementations.
Abstract
Learning is a fundamental property of intelligent systems, observed across biological organisms and engineered systems. While modern intelligent systems typically rely on gradient descent for learning, the need for exact gradients and complex information flow makes its implementation in biological and neuromorphic systems challenging. This has motivated the exploration of alternative learning mechanisms that can operate locally and do not rely on exact gradients. In this work, we introduce a novel approach that leverages noise in the parameters of the system and global reinforcement signals. Using an Ornstein-Uhlenbeck process with adaptive dynamics, our method balances exploration and exploitation during learning, driven by deviations from error predictions, akin to reward prediction error. Operating in continuous time, Orstein-Uhlenbeck adaptation (OUA) is proposed as a general mechanism for learning dynamic, time-evolving environments. We validate our approach across diverse tasks, including supervised learning and reinforcement learning in feedforward and recurrent systems. Additionally, we demonstrate that it can perform meta-learning, adjusting hyper-parameters autonomously. Our results indicate that OUA provides a viable alternative to traditional gradient-based methods, with potential applications in neuromorphic computing. It also hints at a possible mechanism for noise-driven learning in the brain, where stochastic neurotransmitter release may guide synaptic adjustments.
