Table of Contents
Fetching ...

Weak Poincaré Inequalities, Simulated Annealing, and Sampling from Spherical Spin Glasses

Brice Huang, Sidhanth Mohanty, Amit Rajaraman, David X. Wu

Abstract

There has been a recent surge of powerful tools to show rapid mixing of Markov chains, via functional inequalities such as Poincaré inequalities. In many situations, Markov chains fail to mix rapidly from a worst-case initialization, yet are expected to approximately sample from a random initialization. For example, this occurs if the target distribution has metastable states, small clusters accounting for a vanishing fraction of the mass that are essentially disconnected from the bulk of the measure. Under such conditions, a Poincaré inequality cannot hold, necessitating new tools to prove sampling guarantees. We develop a framework to analyze simulated annealing, based on establishing so-called weak Poincaré inequalities. These inequalities imply mixing from a suitably warm start, and simulated annealing provides a way to chain such warm starts together into a sampling algorithm. We further identify a local-to-global principle to prove weak Poincaré inequalities, mirroring the spectral independence and localization schemes frameworks for analyzing mixing times of Markov chains. As our main application, we prove that simulated annealing samples from the Gibbs measure of a spherical spin glass for inverse temperatures up to a natural threshold, matching recent algorithms based on algorithmic stochastic localization. This provides the first Markov chain sampling guarantee that holds beyond the uniqueness threshold for spherical spin glasses, where mixing from a worst-case initialization is provably slow due to the presence of metastable states. As an ingredient in our proof, we prove bounds on the operator norm of the covariance matrix of spherical spin glasses in the full replica-symmetric regime. Additionally, we resolve a question related to sampling using data-based initializations.

Weak Poincaré Inequalities, Simulated Annealing, and Sampling from Spherical Spin Glasses

Abstract

There has been a recent surge of powerful tools to show rapid mixing of Markov chains, via functional inequalities such as Poincaré inequalities. In many situations, Markov chains fail to mix rapidly from a worst-case initialization, yet are expected to approximately sample from a random initialization. For example, this occurs if the target distribution has metastable states, small clusters accounting for a vanishing fraction of the mass that are essentially disconnected from the bulk of the measure. Under such conditions, a Poincaré inequality cannot hold, necessitating new tools to prove sampling guarantees. We develop a framework to analyze simulated annealing, based on establishing so-called weak Poincaré inequalities. These inequalities imply mixing from a suitably warm start, and simulated annealing provides a way to chain such warm starts together into a sampling algorithm. We further identify a local-to-global principle to prove weak Poincaré inequalities, mirroring the spectral independence and localization schemes frameworks for analyzing mixing times of Markov chains. As our main application, we prove that simulated annealing samples from the Gibbs measure of a spherical spin glass for inverse temperatures up to a natural threshold, matching recent algorithms based on algorithmic stochastic localization. This provides the first Markov chain sampling guarantee that holds beyond the uniqueness threshold for spherical spin glasses, where mixing from a worst-case initialization is provably slow due to the presence of metastable states. As an ingredient in our proof, we prove bounds on the operator norm of the covariance matrix of spherical spin glasses in the full replica-symmetric regime. Additionally, we resolve a question related to sampling using data-based initializations.

Paper Structure

This paper contains 32 sections, 72 theorems, 417 equations.

Key Result

Theorem 1.1

For any choice of $\gamma_2,\dots,\gamma_{p_*}$, there is a threshold $\beta_{\mathrm{SL}} \leqslant \beta_{\mathrm{shatter}}$ such that for any $\beta < \beta_{\mathrm{SL}}$, with probability at least $1-e^{-\Omega(N^{1/5})}$ over the randomness of $H$, annealed Langevin diffusion run for $\mathrm{

Theorems & Definitions (164)

  • Theorem 1.1: Informal
  • Remark 1.2
  • Theorem 1.3: Informal, see \ref{['th:annealing']}
  • Theorem 1.4: AL20ALO21
  • Theorem 1.5: Special case of \ref{['lem:stopped-scheme-to-weak-poi', 'rem:other-ls-to-weak-poi']}
  • Remark 1.6
  • Remark 1.7
  • Theorem 1.8: See \ref{['thm:main-p-spin']}
  • Theorem 1.9: Informal, see \ref{['thm:partition-fn-and-covariance']}
  • Theorem 1.10: Informal, see \ref{['th:approx-pi-main']}
  • ...and 154 more