Harnessing uncertainty when learning through Equilibrium Propagation in neural networks
Jonathan Peters, Philippe Talatchian
TL;DR
The paper investigates training deep networks with Equilibrium Propagation (EP) on hardware subject to measurement uncertainty. It introduces a stochastic EP framework for nonlinear resistive networks, modeling post-activation noise with $V^{\text{samp}} = V^{\text{att}} + \sigma\,dB_t$, and shows a dataset-independent critical limit near $\sigma_c \approx 5\times 10^{-5}$; crucially, sampling per attractor state raises this limit according to $\sigma^{\text{act}} = \sigma/\sqrt{N}$ via the Central Limit Theorem, enabling reliable learning on noisier hardware. Empirical results on MNIST, KMNIST, and FashionMNIST reveal that optimal noise levels improve convergence and testing accuracy (e.g., KMNIST from ~77% to ~97%, FashionMNIST from ~26% to ~93%), while MNIST remains reliably learnable even without noise. These findings offer a concrete path toward energy-efficient, self-learning hardware that leverages EP under realistic uncertainties.
Abstract
Equilibrium Propagation (EP) is a supervised learning algorithm that trains network parameters using local neuronal activity. This is in stark contrast to backpropagation, where updating the parameters of the network requires significant data shuffling. Avoiding data movement makes EP particularly compelling as a learning framework for energy-efficient training on neuromorphic systems. In this work, we assess the ability of EP to learn on hardware that contain physical uncertainties. This is particularly important for researchers concerned with hardware implementations of self-learning systems that utilize EP. Our results demonstrate that deep, multi-layer neural network architectures can be trained successfully using EP in the presence of finite uncertainties, up to a critical limit. This limit is independent of the training dataset, and can be scaled through sampling the network according to the central limit theorem. Additionally, we demonstrate improved model convergence and performance for finite levels of uncertainty on the MNIST, KMNIST and FashionMNIST datasets. Optimal performance is found for networks trained with uncertainties close to the critical limit. Our research supports future work to build self-learning hardware in situ with EP.
