Attention-Based Foundation Model for Quantum States
Timothy Zaklama, Daniele Guerci, Liang Fu
TL;DR
The paper introduces Q-stage, an attention-based foundation model that learns a smooth map from Hamiltonian parameters to many-body ground-state wavefunctions by tokenizing Fock-basis configurations and conditionally injecting a parameter token. Trained on a small set of exact diagonalization states, the model generalizes across parameter space and reproduces ground-state energies and phase diagrams for the 1D and 2D t–V models with high fidelity, enabling fast inference across the Hamiltonian manifold. A key finding is that a single shared network can interpolate the wavefunction across ($V/t$, $N$) while equal-time observables follow post hoc from the learned wavefunction; limitations include difficulty with first-order transitions and excited states. The work demonstrates a scalable route to universal quantum-state representations that can extend to broader Hamiltonians and continuum settings, potentially enabling unified modeling of quantum matter across diverse systems.
Abstract
We present an attention-based foundation model architecture for learning and predicting quantum states across Hamiltonian parameters, system sizes, and physical systems. Using only basis configurations and physical parameters as inputs, our trained neural network is able to produce highly accurate ground state wavefunctions. For example, we build the phase diagram for the 2D square-lattice $t-V$ model with $N$ particles, from only 18 parameters $(V/t,N)$. Thus, our architecture provides a basis for building a universal foundation model for quantum matter.
