Saddle Hierarchy in Dense Associative Memory
Robin Thériault, Daniele Tantari
TL;DR
This work develops a Dense Associative Memory (DAM) framework built as a three-layer Boltzmann machine with Potts hidden units to study stationary points in both real-data training and teacher-student data generation. The authors derive saddle-point equations that characterize DAMs trained on real data and those trained in a teacher-student setting, then introduce an effective loss with a regularization parameter to stabilize training via β_eff = varsigma(2 upsilon) β. They show a saddle-point hierarchy in which memories learned by narrower DAMs appear as saddles in wider DAMs, enabling a network-growth strategy via splitting steepest descent that significantly reduces training cost, often with runtime scaling near log P_max. Empirically, the DAM learns interpretable prototypes (w^mu) with meaningful soft labels (p^gamma_y) and achieves high classification fidelity, including ~98% accuracy with a 1-NN classifier on memories, while splitting-based training matches accuracy with far fewer computations on MNIST-like data. The results connect energy-based DAMs to modern mechanisms like attention and diffusion constructs, offering a principled path to scalable, interpretable, and robust pattern classification.
Abstract
Dense Associative Memory (DAM) models have been attracting renewed attention since they were shown to be robust to adversarial examples and closely related to cutting edge machine learning paradigms, such as the attention mechanism and generative diffusion. We study a DAM built upon a three-layer Boltzmann machine with Potts hidden units, which represent data clusters and classes. Through a statistical mechanics analysis, we derive saddle-point equations that characterize both the stationary points of DAMs trained on real data and the fixed points of DAMs trained on synthetic data within a teacher-student framework. Based on these results, we propose a novel regularization scheme that makes training significantly more stable. Moreover, we show empirically that our DAM learns interpretable solutions to both supervised and unsupervised classification problems. Pushing our theoretical analysis further, we find that the weights learned by relatively small DAMs correspond to unstable saddle points in larger DAMs. We implement a network-growing algorithm that leverages this saddle-point hierarchy to drastically reduce the computational cost of training dense associative memory.
