Dense Hopfield Networks in the Teacher-Student Setting
Robin Thériault, Daniele Tantari
TL;DR
The paper analyzes dense $p$-body Hopfield networks in a controlled teacher–student (inverse learning) setting, deriving the phase diagram for unsupervised pattern retrieval and revealing ferromagnetic regimes that correspond to prototype and feature learning. It shows that on the Nishimori line the inverse problem remains replica-symmetric and that the retrieval transition aligns with the direct model’s spin-glass transition, while outside this line larger student $p$ yields extensive noise tolerance and an explicit zero-temperature adversarial-robustness formula. The work provides exact RS results for phase boundaries, clarifies universality connections between direct and inverse problems, and explains why prototype phases exhibit adversarial robustness. These insights illuminate how increasing model capacity and aligning teacher-student parameters can enhance robustness and data efficiency in dense Hopfield networks, with potential implications for understanding modern robust representations and learning architectures.
Abstract
Dense Hopfield networks are known for their feature to prototype transition and adversarial robustness. However, previous theoretical studies have been mostly concerned with their storage capacity. We bridge this gap by studying the phase diagram of p-body Hopfield networks in the teacher-student setting of an unsupervised learning problem, uncovering ferromagnetic phases reminiscent of the prototype and feature learning regimes. On the Nishimori line, we find the critical size of the training set necessary for efficient pattern retrieval. Interestingly, we find that that the paramagnetic to ferromagnetic transition of the teacher-student setting coincides with the paramagnetic to spin-glass transition of the direct model, i.e. with random patterns. Outside of the Nishimori line, we investigate the learning performance in relation to the inference temperature and dataset noise. Moreover, we show that using a larger p for the student than the teacher gives the student an extensive tolerance to noise. We then derive a closed-form expression measuring the adversarial robustness of such a student at zero temperature, corroborating the positive correlation between number of parameters and robustness observed in large neural networks. We also use our model to clarify why the prototype phase of modern Hopfield networks is adversarially robust.
