Neural Field Transformations for Hybrid Monte Carlo: Architectural Design and Scaling
Jinchen He, Xiao-Yong Jin, James C. Osborn, Yong Zhao
TL;DR
The paper addresses critical slowing down in Hybrid Monte Carlo sampling for lattice gauge theories and proposes neural field transformations within the NTHMC framework to reshape the energy landscape via an invertible, gauge-covariant map $\mathcal{F}$, producing an effective action $S_{\mathrm{FT}}(\tilde{U}) = S(\mathcal{F}(\tilde{U})) - \log| \det \mathcal{F}^*(\tilde{U}) |$ for sampling. It systematically analyzes architectural designs (e.g., residual connections, attention, channel-dependent activation) of gauge-covariant CNNs on a 2D $U(1)$ theory, training at $\beta=3$, $V=32^2$, and assessing transfer to larger volumes and finer lattices using autocorrelation $\gamma(\delta)$ and topological-change metrics $\Delta Q$. The results show that wider receptive fields and channel-dependent activations boost sampling efficiency, with the Comb architecture delivering the largest gains (approximately a 40% reduction in $\gamma(16)$) and improved topological tunneling, while maintaining favorable scaling across volume and lattice spacing. These findings provide concrete design guidelines for extending NTHMC to four-dimensional $SU(3)$ gauge theories and large-scale lattice QCD, with future work on integrator choices, training objectives beyond force minimization, and broader training regimes.
Abstract
Critical slowing down, where autocorrelation grows rapidly near the continuum limit due to Hybrid Monte Carlo (HMC) moving through configuration space inefficiently, still challenges lattice gauge theory simulations. Combining neural field transformations with HMC (NTHMC) can reshape the energy landscape and accelerate sampling, but the choice of neural architectures has yet to be studied systematically. We evaluate NTHMC on a two-dimensional U(1) gauge theory, analyzing how it scales and transfers to larger volumes and smaller lattice spacing. Controlled comparisons let us isolate architectural contributions to sampling efficiency. Good designs can reduce autocorrelation and boost topological tunneling while maintaining favorable scaling. More broadly, our study highlights emerging design guides, such as wider receptive fields and channel-dependent activations, paving the way for systematic extensions to four-dimensional SU(3) gauge theory.
