Computationally sufficient statistics for Ising models
Abhijith Jayakumar, Shreya Shukla, Marc Vuffray, Andrey Y. Lokhov, Sidhant Misra
TL;DR
This work tackles the problem of learning Ising-model Gibbs distributions when only limited statistics are observable, linking computational tractability to observational power. It introduces a polynomially-approximated gradient for the Interaction Screening Estimator, enabling convex optimization using low-order moments and establishing that statistics of order $O(\gamma)$ suffice for accurate parameter, structure, and magnetic-field recovery, with a sample complexity that scales as $O\big(e^{8\gamma}\gamma^4\log p / \epsilon^4\big)$. The paper also shows that structure can be recovered via thresholding under a separation condition, and that magnetic fields can be learned after fixing the structure; with stronger priors (e.g., known $D$-regular graphs), even lower-order statistics become sufficient. These results reveal a concrete information-computation tradeoff in learning discrete graphical models and provide practical algorithms for scenarios where full configurations are unavailable, with potential extensions to physics-inspired priors and more constrained model classes.
Abstract
Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, computationally efficient algorithms for learning Gibbs distributions rely on access to full sample configurations generated from the model. For many systems of interest that arise in physical contexts, expecting a full sample to be observed is not practical, and hence it is important to look for computationally efficient methods that solve the learning problem with access to only a limited set of statistics. We examine the trade-offs between the power of computation and observation within this scenario, employing the Ising model as a paradigmatic example. We demonstrate that it is feasible to reconstruct the model parameters for a model with $\ell_1$ width $γ$ by observing statistics up to an order of $O(γ)$. This approach allows us to infer the model's structure and also learn its couplings and magnetic fields. We also discuss a setting where prior information about structure of the model is available and show that the learning problem can be solved efficiently with even more limited observational power.
