Entanglement and optimization within autoregressive neural quantum states
Andrew Jreissaty, Hang Zhang, Jairo C. Quijano, Juan Carrasquilla, Roeland Wiersema
TL;DR
This work analyzes how autoregressive neural quantum states—specifically recurrent neural networks and autoregressive transformers—encode entanglement in strongly correlated spin systems. By introducing a square-modulus normalization to replace the conventional softmax, the authors reveal entanglement transitions and spectral statistics that resemble thermal and MBL-like regimes, and show that entanglement can be tuned via hyperparameters such as the hidden/embedding dimension and the Gaussian width of parameter initialization. They construct entanglement phase diagrams, quantify scaling with system size, and connect these properties to variational Monte Carlo performance for finding ground states of models like the TFIM and Heisenberg chain, uncovering optimal initialization strategies. The findings offer a path to more effective ground-state searches and provide insight into the expressive power and phase structure of autoregressive NQS, with implications for simulating highly entangled quantum matter. The work also demonstrates that Circulant attention in ATFs and MOD normalization can preserve or enhance entanglement, informing future design of NQS architectures for quantum many-body problems.
Abstract
Neural quantum states (NQSs) are powerful variational ansätze capable of representing highly entangled quantum many-body wavefunctions. While the average entanglement properties of ensembles of restricted Boltzmann machines are well understood, the entanglement structure of autoregressive NQSs such as recurrent neural networks and transformers remains largely unexplored. We perform large-scale simulations of ensembles of random autoregressive wavefunctions for chains of up to $256$ spins and uncover signatures of transitions in their average entanglement scaling, entanglement spectra, and correlation functions. We show that the standard softmax normalization of the wavefunction suppresses entanglement and fluctuations, and introduce a square modulus normalization function that restores them. Finally, we connect the insights gained from our entanglement and activation function analysis to initialization strategies for finding the ground states of strongly correlated Hamiltonians via variational Monte Carlo.
