Scaling of Stochastic Normalizing Flows in $\mathrm{SU}(3)$ lattice gauge theory
Andrea Bulgarelli, Elia Cellini, Alessandro Nada
TL;DR
This work demonstrates the first implementation of Stochastic Normalizing Flows for SU(3) lattice gauge theory in four dimensions and provides evidence that SNFs scale with the system's degrees of freedom in the same way as NE-MCMC. By interleaving gauge-equivariant NF layers with out-of-equilibrium MC updates, the authors achieve roughly a twofold improvement in key sampling metrics like the KL divergence and the Effective Sample Size, while keeping training costs modest. The study shows that the performance improvements persist across volumes and lattice spacings, with scaling governed by the ratio $n_{\mathrm{step}}/(L/a)^4$. These results point to SNFs as a scalable and practical approach to mitigate critical slowing down in high-dimensional gauge theories and motivate future work on protocol optimization and more expressive equivariant layers.
Abstract
Non-equilibrium Markov Chain Monte Carlo (NE-MCMC) simulations provide a well-understood framework based on Jarzynski's equality to sample from a target probability distribution. By driving a base probability distribution out of equilibrium, observables are computed without the need to thermalize. If the base distribution is characterized by mild autocorrelations, this approach provides a way to mitigate critical slowing down. Out-of-equilibrium evolutions share the same framework of flow-based approaches and they can be naturally combined into a novel architecture called Stochastic Normalizing Flows (SNFs). In this work we present the first implementation of SNFs for $\mathrm{SU}(3)$ lattice gauge theory in 4 dimensions, defined by introducing gauge-equivariant layers between out-of-equilibrium Monte Carlo updates. The core of our analysis is focused on the promising scaling properties of this architecture with the degrees of freedom of the system, which are directly inherited from NE-MCMC. Finally, we discuss how systematic improvements of this approach can realistically lead to a general and yet efficient sampling strategy at fine lattice spacings for observables affected by long autocorrelation times.
