Leveraging recurrence in neural network wavefunctions for large-scale simulations of Heisenberg antiferromagnets on the square lattice
M. Schuyler Moss, Roeland Wiersema, Mohamed Hibat-Allah, Juan Carrasquilla, Roger G. Melko
TL;DR
This work develops and benchmarks a two-dimensional recurrent neural network variational wavefunction for the square-lattice Heisenberg antiferromagnet, enabling ground-state studies on lattices with more than 1000 spins via iterative retraining. By reusing trained weights across increasing system sizes and enforcing symmetries, the authors perform finite-size scaling to extract thermodynamic-limit observables such as the ground-state energy and sublattice magnetization, achieving close agreement with quantum Monte Carlo and PEPS benchmarks. The study also compares energies and correlation observables, showing that stronger variational energies do not always guarantee improved correlations and highlighting the need to assess both energies and correlations in variational studies. The results demonstrate that RNN-based variational states can capture key ground-state properties in the thermodynamic limit and suggest directions for improving expressiveness and exploring other lattices with scalable train-time strategies.
Abstract
Machine-learning-based variational Monte Carlo simulations are a promising approach for targeting quantum many-body ground states, especially in two dimensions and in cases where the ground state is known to have a non-trivial sign structure. While many state-of-the-art variational energies have been reached with these methods for finite-size systems, little work has been done to use these results to extract information about the target state in the thermodynamic limit. In this work, we employ recurrent neural networks (RNNs) as a variational ansätze, and leverage their recurrent nature to simulate the ground states of progressively larger systems through iterative retraining. This transfer learning technique allows us to simulate spin-$\frac{1}{2}$ systems on lattices with more than 1,000 spins without beginning optimization from scratch for each system size, thus reducing the demands for computational resources. In this study, we focus on the square-lattice antiferromagnetic Heisenberg model, where it is possible to carefully benchmark our results. We show that we are able to systematically improve the accuracy of the results from our simulations by increasing the training time, and obtain results for finite-sized lattices that are in good agreement with the literature values. Furthermore, we use these results to extract accurate estimates of the ground-state properties in the thermodynamic limit. This work demonstrates that RNN wavefunctions are able to extract accurate information about quantum many-body systems in the thermodynamic limit.
