Dense associative memory for Gaussian distributions
Chandan Tankala, Krishnakumar Balasubramanian
TL;DR
This work extends dense associative memories to operate on Gaussian distributions by endowing the space of Gaussians with the Bures–Wasserstein metric and defining a log-sum-exp energy over stored distributions. Retrieval is performed via Gibbs-weighted aggregation of optimal transport maps, with fixed points corresponding to Wasserstein barycenters, enabling distributional recall. The authors prove exponential storage capacity and retrieval guarantees under Wasserstein perturbations and validate BW-DAM on synthetic data and real-world Gaussian embeddings for images, text, and sentences, demonstrating superior robustness and accuracy to Euclidean baselines. Overall, the framework bridges classical DAMs with modern distributional representations, opening avenues for uncertainty-aware memory and probabilistic reasoning in memory-augmented learning.
Abstract
Dense associative memories (DAMs) store and retrieve patterns via energy-function based fixed points, but existing models are limited to vector representations. We extend DAMs to Gaussian densities equipped with the 2-Wasserstein distance. Our framework defines a log-sum-exp energy over stored distributions and a retrieval dynamics aggregating optimal transport maps in a Gibbs-weighted manner. Stationary points correspond to self-consistent Wasserstein barycenters, generalizing classical DAM fixed points. We prove exponential storage capacity and provide quantitative retrieval guarantees under Wasserstein perturbations. We validate the method on synthetic and real-world image (CelebA and CIFAR-10 datasets) and text (text8 and NLI corpus) datasets. By generalizing from vectors to distributions, our work bridges classical DAMs with modern generative modeling and paves way for distributional storage and retrieval in memory-augmented learning.
