Convergence of variational Monte Carlo simulation and scale-invariant pre-training
Nilin Abrahamsen, Zhiyan Ding, Gil Goldshlager, Lin Lin
TL;DR
The paper addresses convergence for variational Monte Carlo applied to neural-network wave functions in electronic structure by analyzing both energy minimization and scale-invariant supervised pre-training. It leverages the scale-invariant Rayleigh quotient and introduces a directionally unbiased gradient estimator to prove convergence bounds for SGD-like updates with MCMC sampling. A scale-invariant loss is proposed for pre-training, with theoretical guarantees mirroring nonconvex SGD rates, and numerical experiments demonstrate faster pre-training and plausible VMC convergence on small, strongly correlated systems. The results suggest scalable, principled guidance for optimizing neural quantum states and point toward extensions to alternative optimization schemes and manifold-based formulations.
Abstract
We provide theoretical convergence bounds for the variational Monte Carlo (VMC) method as applied to optimize neural network wave functions for the electronic structure problem. We study both the energy minimization phase and the supervised pre-training phase that is commonly used prior to energy minimization. For the energy minimization phase, the standard algorithm is scale-invariant by design, and we provide a proof of convergence for this algorithm without modifications. The pre-training stage typically does not feature such scale-invariance. We propose using a scale-invariant loss for the pretraining phase and demonstrate empirically that it leads to faster pre-training.
