A Fast Algorithm to Simulate Nonlinear Resistive Networks
Benjamin Scellier
TL;DR
This work reframes nonlinear resistive networks as a convex quadratic program over a linear-feasible set, enabling exact, fast steady-state computation via coordinate descent. It then specializes the solver to Deep Resistive Networks (DRNs), exploiting bipartite layer structure to perform exact block coordinate descent and achieve dramatic speedups over SPICE while scaling networks to MNIST-sized tasks. Empirical results show DRNs trained with equilibrium propagation reach 1.33% test error on MNIST with networks up to 327× larger and 160× faster per epoch, enabling efficient large-scale analog learning simulations. The approach promises scalable, energy-efficient hardware-inspired ML research, with clear paths to handle nonideal device behavior and to integrate with alternative learning paradigms.
Abstract
Analog electrical networks have long been investigated as energy-efficient computing platforms for machine learning, leveraging analog physics during inference. More recently, resistor networks have sparked particular interest due to their ability to learn using local rules (such as equilibrium propagation), enabling potentially important energy efficiency gains for training as well. Despite their potential advantage, the simulations of these resistor networks has been a significant bottleneck to assess their scalability, with current methods either being limited to linear networks or relying on realistic, yet slow circuit simulators like SPICE. Assuming ideal circuit elements, we introduce a novel approach for the simulation of nonlinear resistive networks, which we frame as a quadratic programming problem with linear inequality constraints, and which we solve using a fast, exact coordinate descent algorithm. Our simulation methodology significantly outperforms existing SPICE-based simulations, enabling the training of networks up to 327 times larger at speeds 160 times faster, resulting in a 50,000-fold improvement in the ratio of network size to epoch duration. Our approach can foster more rapid progress in the simulations of nonlinear analog electrical networks.
