Memory in Plain Sight: Surveying the Uncanny Resemblances of Associative Memories and Diffusion Models
Benjamin Hoover, Hendrik Strobelt, Dmitry Krotov, Judy Hoffman, Zsolt Kira, Duen Horng Chau
TL;DR
This paper provides a unified, accessible perspective that treats Diffusion Models as a form of energy-based memory retrieval akin to Associative Memories, contrasting their lack of guaranteed Lyapunov stability with AMs that possess explicit Lyapunov functions. It articulates precise mathematical connections via score functions, energy landscapes, and continuous-time dynamics (PF-ODEs), and highlights both convergences and important differences, such as the role of noise schedules and fixed-point guarantees. The authors survey foundational AM architectures (Hopfield nets, DAM/MHN, HAM) and show how DM dynamics can be interpreted through AM lenses, while identifying practical implications and opportunities for cross-pollination, including energy-constrained DM variants and AM-inspired architectures like Energy Transformers. The work concludes with open challenges, proposing directions for extending AM theory to large-scale diffusion pipelines and for adopting diffusion-inspired memory insights to advance AM research, with potential impact on scaling laws and memory capacity understanding in AI systems.
Abstract
The generative process of Diffusion Models (DMs) has recently set state-of-the-art on many AI generation benchmarks. Though the generative process is traditionally understood as an "iterative denoiser", there is no universally accepted language to describe it. We introduce a novel perspective to describe DMs using the mathematical language of memory retrieval from the field of energy-based Associative Memories (AMs), making efforts to keep our presentation approachable to newcomers to both of these fields. Unifying these two fields provides insight that DMs can be seen as a particular kind of AM where Lyapunov stability guarantees are bypassed by intelligently engineering the dynamics (i.e., the noise and step size schedules) of the denoising process. Finally, we present a growing body of evidence that records DMs exhibiting empirical behavior we would expect from AMs, and conclude by discussing research opportunities that are revealed by understanding DMs as a form of energy-based memory.
