Scalable Quantum Monte Carlo Method for Polariton Chemistry via Mixed Block Sparsity and Tensor Hypercontraction Method
Yu Zhang
TL;DR
This work addresses the computational bottleneck of exchange-energy evaluation in AFQMC when applied to polaritonic chemistry and large molecular ensembles by introducing a mixed block-sparsity and tensor hypercontraction (BS-THC) representation of Cholesky tensors. By exploiting block sparsity from spatial locality and the low-rank nature of most Cholesky blocks, the authors route high-rank blocks to a block-sparse path and compress genuinely low-rank blocks with THC, achieving an overall $O(N^3)$ scaling and memory near $O(N^2)$ while preserving AFQMC accuracy. Numerical benchmarks on 1D, 2D, and 3D molecular ensembles (up to ~1200 orbitals) reveal linear nonzero growth per block and sublinear average rank growth, along with pronounced rank heterogeneity that motivates the mixed BS-THC strategy. The results enable scalable, predictive AFQMC simulations of cavity-modified chemistry and strongly correlated polaritonic matter, with potential extensions to other ERI-dominated methods such as CCSD. This mixed approach thus provides a practical route to cubic-scaling exchange-energy evaluation in large quantum ensembles.
Abstract
We present a reduced-scaling auxiliary-field quantum Monte Carlo (AFQMC) framework designed for large molecular systems and ensembles, with or without coupling to optical cavities. Our approach leverages the natural block sparsity of Cholesky decomposition (CD) of electron repulsion integrals in molecular ensembles and employs tensor hypercontraction (THC) to efficiently compress low-rank Cholesky blocks. By representing the Cholesky vectors in a mixed format, keeping high-rank blocks in block-sparse form and compressing low-rank blocks with THC, we reduce the scaling of exchange-energy evaluation from quartic to robust cubic in the number of molecular orbitals, while lowering memory from cubic toward quadratic. Benchmark analyses on one-, two-, and three-dimensional molecular ensembles (up to ~1,200 orbitals) show that: a) the number of nonzeros in Cholesky tensors grows linearly with system size across dimensions; b) the average numerical rank increases sublinearly and does not saturate at these sizes; and (c) rank heterogeneity-some blocks nearly full rank and many low rank, naturally motivating the proposed mixed block sparsity and THC scheme for efficient calculation of exchange energy. We demonstrate that the mixed scheme yields cubic CPU-time scaling with favorable prefactors and preserves AFQMC accuracy.
