Hopfield model with planted patterns: a teacher-student self-supervised learning model
Francesco Alemanno, Luca Camanzi, Gianluca Manzan, Daniele Tantari
TL;DR
This work extends the Hopfield model by embedding planted, correlated patterns within a teacher-student self-supervised learning framework, where the student weights act as patterns and the dataset encodes the planted signal. It shows a transition between memorization and generalization driven by the training set size $M$, dataset noise $\beta$, and the inference temperature $\beta^{-1}$, with a key analytic result on the Nishimori line giving a phase boundary $\beta_c^{-1}=1+\sqrt{\gamma}$ (extensive data $M=\gamma N$). Replica-symmetric analysis reveals nonzero learning order parameters $m$ and $q$ above the critical line, with $m=q$ on the Nishimori line, indicating learning by generalization without spin-glass, while memorization dominates at low $M$ or high noise. The results connect classical memory-capacity phenomena to self-supervised learning, showing how dataset structure and size can enable generalization even when individual examples are weakly informative.
Abstract
While Hopfield networks are known as paradigmatic models for memory storage and retrieval, modern artificial intelligence systems mainly stand on the machine learning paradigm. We show that it is possible to formulate a teacher-student self-supervised learning problem with Boltzmann machines in terms of a suitable generalization of the Hopfield model with structured patterns, where the spin variables are the machine weights and patterns correspond to the training set's examples. We analyze the learning performance by studying the phase diagram in terms of the training set size, the dataset noise and the inference temperature (i.e. the weight regularization). With a small but informative dataset the machine can learn by memorization. With a noisy dataset, an extensive number of examples above a critical threshold is needed. In this regime the memory storage limits of the system becomes an opportunity for the occurrence of a learning regime in which the system can generalize.
