Dense Associative Memories with Analog Circuits
Marc Gong Bacvanski, Xincheng You, John Hopfield, Dmitry Krotov
TL;DR
The paper tackles the energy and latency bottlenecks of digital AI inference by proposing Dense Associative Memories (DenseAM) as an energy-based, continuous-time computation framework implemented on analog hardware. It introduces a full hardware design using RC circuits and resistive crossbars to realize DenseAM dynamics, achieving constant-time inference largely independent of model size. Through experiments on XOR, Hamming(7,4), and a parity/energy-transformer-inspired autoregressive task, it analyzes how inference time, energy, and hardware area scale, showing favorable linear energy scaling and practical latency bounds within CMOS technology. The work suggests a compelling co-design path for future AI accelerators where stable attractor dynamics and global energy minimization underpin fast, scalable inference across memory-centric and transformer-like architectures.
Abstract
The increasing computational demands of modern AI systems have exposed fundamental limitations of digital hardware, driving interest in alternative paradigms for efficient large-scale inference. Dense Associative Memory (DenseAM) is a family of models that offers a flexible framework for representing many contemporary neural architectures, such as transformers and diffusion models, by casting them as dynamical systems evolving on an energy landscape. In this work, we propose a general method for building analog accelerators for DenseAMs and implementing them using electronic RC circuits, crossbar arrays, and amplifiers. We find that our analog DenseAM hardware performs inference in constant time independent of model size. This result highlights an asymptotic advantage of analog DenseAMs over digital numerical solvers that scale at least linearly with the model size. We consider three settings of progressively increasing complexity: XOR, the Hamming (7,4) code, and a simple language model defined on binary variables. We propose analog implementations of these three models and analyze the scaling of inference time, energy consumption, and hardware. Finally, we estimate lower bounds on the achievable time constants imposed by amplifier specifications, suggesting that even conservative existing analog technology can enable inference times on the order of tens to hundreds of nanoseconds. By harnessing the intrinsic parallelism and continuous-time operation of analog circuits, our DenseAM-based accelerator design offers a new avenue for fast and scalable AI hardware.
