Knowledge as a Breaking of Ergodicity

Yang He; Vassiliy Lubchenko

Knowledge as a Breaking of Ergodicity

Yang He, Vassiliy Lubchenko

TL;DR

The work reframes knowledge acquisition from binary data as a thermodynamic problem, introducing a high-order Ising-like energy $E(oldsymbol{\sigma})$ whose couplings $\mathbf{J}$ are learned from dataset weights via $J = -2^{-N}\sum_i \boldsymbol{\sigma}_i E_i$, and retrieval proceeds with Gibbs sampling at temperature $T$. It constructs a conjoint free-energy surface $A({E_i},T)$ and a Legendre-transformed version over coarse-grained weights $x_i$, linking learning and retrieval to minimization on a thermodynamic landscape and highlighting a Gibbs-inequality bound. The central finding is that reducing description to a smaller set of couplings induces multiple free-energy minima, fracturing the configuration space into ergodic subspaces and creating kinetic bottlenecks that complicate learning and retrieval; this ergodicity breaking is analogous to phase coexistence and requires remedies such as parameterizing non-represented (unseen) states with an extensive energy gap and possibly deploying multiple expert models for distinct minima. These insights connect to broader themes in physics-inspired inference, inform strategies for robust, context-aware knowledge libraries, and suggest practical parallels to force-field design and protein-folding-like landscape funneling in complex systems.

Abstract

We construct a thermodynamic potential that can guide training of a generative model defined on a set of binary degrees of freedom. We argue that upon reduction in description, so as to make the generative model computationally-manageable, the potential develops multiple minima. This is mirrored by the emergence of multiple minima in the free energy proper of the generative model itself. The variety of training samples that employ N binary degrees of freedom is ordinarily much lower than the size 2^N of the full phase space. The non-represented configurations, we argue, should be thought of as comprising a high-temperature phase separated by an extensive energy gap from the configurations composing the training set. Thus, training amounts to sampling a free energy surface in the form of a library of distinct bound states, each of which breaks ergodicity. The ergodicity breaking prevents escape into the near continuum of states comprising the high-temperature phase; thus it is necessary for proper functionality. It may however have the side effect of limiting access to patterns that were underrepresented in the training set. At the same time, the ergodicity breaking within the library complicates both learning and retrieval. As a remedy, one may concurrently employ multiple generative models -- up to one model per free energy minimum.

Knowledge as a Breaking of Ergodicity

TL;DR

The work reframes knowledge acquisition from binary data as a thermodynamic problem, introducing a high-order Ising-like energy

whose couplings

are learned from dataset weights via

, and retrieval proceeds with Gibbs sampling at temperature

. It constructs a conjoint free-energy surface

and a Legendre-transformed version over coarse-grained weights

, linking learning and retrieval to minimization on a thermodynamic landscape and highlighting a Gibbs-inequality bound. The central finding is that reducing description to a smaller set of couplings induces multiple free-energy minima, fracturing the configuration space into ergodic subspaces and creating kinetic bottlenecks that complicate learning and retrieval; this ergodicity breaking is analogous to phase coexistence and requires remedies such as parameterizing non-represented (unseen) states with an extensive energy gap and possibly deploying multiple expert models for distinct minima. These insights connect to broader themes in physics-inspired inference, inform strategies for robust, context-aware knowledge libraries, and suggest practical parallels to force-field design and protein-folding-like landscape funneling in complex systems.

Knowledge as a Breaking of Ergodicity

TL;DR

Abstract

Knowledge as a Breaking of Ergodicity

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)