An algorithm for atom-centered lossy compression of the atomic orbital basis in density functional theory calculations
Anthony O. Lara, Justin J. Talbot, Zhe Wang, Martin Head-Gordon
TL;DR
This paper tackles the computational burden of approaching the complete basis set limit in DFT by introducing an atom-centered compression scheme based on natural atomic orbitals (NAOs) derived from atomic blocks of the density matrix in a one-center orthogonalized representation. NAOs provide a physically meaningful, localized basis, enabling a single occupation-threshold to prune insignificantly occupied functions while preserving accuracy; in HF tests with a QZ pc-3 basis, thresholds around $10^{-5}$ yield compression factors of $2.5$–$4.5$ and energy errors typically below $0.1$ kcal/mol, with tighter thresholds ($10^{-7}$) achieving errors near $0.01$ kcal/mol and compression around $2$–$2.5$. The results show that larger basis sets yield greater compressibility, that relative energies can be robust to compression due to error cancellation in many cases, and that the method provides a practical pathway to accelerate SCF calculations in large-basis regimes, with future work on dual-basis corrections and exploiting compression to speed up linear-algebra and RI-based steps. Overall, the work offers a proof of concept for controllable AO-basis compression that preserves essential electronic structure information while enabling substantial reductions in basis size and computational effort.
Abstract
Large atomic-orbital (AO) basis sets of at least triple and preferably quadruple-zeta (QZ) size are required to adequately converge Kohn-Sham density functional theory (DFT) calculations towards the complete basis set limit. However, incrementing the cardinal number by one nearly doubles the AO basis dimension, and the computational cost scales as the cube of the AO dimension, so this is very computationally demanding. In this work, we develop and test a natural atomic orbital (NAO) scheme in which the NAOs are obtained as eigenfunctions of atomic blocks of the density matrix in a one-center orthogonalized representation. The NAO representation enables one-center compression of the AO basis in a manner that is optimal for a given threshold, by discarding NAOs with occupation numbers below that threshold. Extensive tests using the Hartree-Fock functional suggest that a threshold of $10^{-5}$ can yield a compression factor (ratio of AO to compressed NAO dimension) between 2.5 and 4.5 for the QZ pc-3 basis. The errors in relative energies are typically less than 0.1 kcal/mol when the compressed basis is used instead of the uncompressed basis. Between 10 and 100 times smaller errors (i.e., usually less than 0.01 kcal/mol) can be obtained with a threshold $10^{-7}$, while the compression factor is typically between 2 and 2.5.
