Modern Methods in Associative Memory
Dmitry Krotov, Benjamin Hoover, Parikshit Ram, Bao Pham
TL;DR
This paper surveys modern energy-based associative memories (AMs), with a focus on DenseAMs that generalize Hopfield networks to dramatically increase storage capacity. It introduces HAMUX, a modular energy framework that decomposes AMs into neuron-layer and hypersynapse components, enabling deep, hierarchical architectures and energy-based analogs of transformer blocks (Energy Transformer). The work connects AM dynamics to diffusion models, showing that diffusion can be interpreted as AM-like memory recall in the small-data regime and as generative modeling in the large-data regime, providing a unifying view of memory, memorization, and generalization. It also situates AMs within broader machine learning practice, illustrating parametric vs nonparametric formulations, supervised and clustering tasks, and kernel-based interpretations, with practical notebooks for hands-on exploration.
Abstract
Associative Memories like the famous Hopfield Networks are elegant models for describing fully recurrent neural networks whose fundamental job is to store and retrieve information. In the past few years they experienced a surge of interest due to novel theoretical results pertaining to their information storage capabilities, and their relationship with SOTA AI architectures, such as Transformers and Diffusion Models. These connections open up possibilities for interpreting the computation of traditional AI networks through the theoretical lens of Associative Memories. Additionally, novel Lagrangian formulations of these networks make it possible to design powerful distributed models that learn useful representations and inform the design of novel architectures. This tutorial provides an approachable introduction to Associative Memories, emphasizing the modern language and methods used in this area of research, with practical hands-on mathematical derivations and coding notebooks.
