Generative diffusion for perceptron problems: statistical physics analysis and efficient algorithms
Elizaveta Demyanenko, Davide Straziota, Carlo Baldassi, Carlo Lucibello
TL;DR
This work develops a replica-informed diffusion framework (Algorithmic Stochastic Localization, ASL) to assess and enhance sampling from high-dimensional, unnormalized densities, focusing on random perceptron problems. By applying a double replica trick, the authors derive time-dependent free entropy φ_t and identify algorithmic thresholds for successful sampling with AMP-driven diffusion. They show that the spherical perceptron can be efficiently sampled in large RS regions, while the uniform sampling of the binary perceptron is inherently hard; introducing a diverging potential U(s) = −log(s) enables efficient tilted sampling and, coupled with tau-annealed MCMC, provides a robust practical sampler. The results illuminate how the geometry of the solution space (RS vs. RSB) governs sampler performance and suggest concrete algorithmic strategies to tackle hard constraint-satisfaction problems beyond perceptrons.
Abstract
We consider random instances of non-convex perceptron problems in the high-dimensional limit of a large number of examples $M$ and weights $N$, with finite load $α= M/N$. We develop a formalism based on replica theory to predict the fundamental limits of efficiently sampling the solution space using generative diffusion algorithms, conjectured to be saturated when the score function is provided by Approximate Message Passing. For the spherical perceptron with negative margin $κ$, we find that the uniform distribution over solutions can be efficiently sampled in most of the Replica Symmetric region of the $α$-$κ$ plane. In contrast, for binary weights, sampling from the uniform distribution remains intractable. A theoretical analysis of this obstruction leads us to identify a potential $U(s) = -\log(s)$, under which the corresponding tilted distribution becomes efficiently samplable via diffusion. Moreover, we show numerically that an annealing procedure over the shape of this potential yields a fast and robust Markov Chain Monte Carlo algorithm for sampling the solution space of the binary perceptron.
