Learning phases with Quantum Monte Carlo simulation cell
Amrita Ghosh, Mugdha Sarkar, Ying-Jer Kao, Pochung Chen
TL;DR
The paper tackles the challenge of extracting phase information from large, high-dimensional QMC data by introducing spin-opstring, a compact representation that combines the initial spin state with the SSE operator string. It demonstrates its effectiveness for phase classification, nonlocal observable regression, and transfer learning across related models and system sizes, while enabling interpretable ML via SHAP and LRP. The results show accurate phase boundaries, compatible predictions of nonlocal observables, substantial memory and compute savings, and robust generalization beyond training domains. The work suggests spin-opstring as a versatile input for ML in quantum many-body physics, with promising avenues for unsupervised learning and generative modeling.
Abstract
We propose the use of the ``spin-opstring", derived from Stochastic Series Expansion Quantum Monte Carlo (QMC) simulations as machine learning (ML) input data. It offers a compact, memory-efficient representation of QMC simulation cells, combining the initial state with an operator string that encodes the state's evolution through imaginary time. Using supervised ML, we demonstrate the input's effectiveness in capturing both conventional and topological phase transitions, and in a regression task to predict non-local observables. We also demonstrate the capability of spin-opstring data in transfer learning by training models on one quantum system and successfully predicting on another, as well as showing that models trained on smaller system sizes generalize well to larger ones. Importantly, we illustrate a clear advantage of spin-opstring over conventional spin configurations in the accurate prediction of a quantum phase transition. Finally, we show how the inherent structure of spin-opstring provides an elegant framework for the interpretability of ML predictions. Using two state-of-the-art interpretability techniques, Layer-wise Relevance Propagation and SHapley Additive exPlanations, we show that the ML models learn and rely on physically meaningful features from the input data. Together, these findings establish the spin-opstring as a broadly-applicable and interpretable input format for ML in quantum many-body physics.
