Algorithmic differentiation for plane-wave DFT: materials design, error control and learning model parameters
Niklas Frederik Schmitz, Bruno Ploumhans, Michael F. Herbst
TL;DR
The paper introduces AD-DFPT, a framework that integrates forward-mode algorithmic differentiation with density-functional perturbation theory to compute end-to-end derivatives in plane-wave DFT workflows. Implemented in the DFTK package, it enables gradient-based exploration over a wide range of input parameters, from geometry and pseudopotentials to XC functional parameters, while leveraging a custom DFPT solve for the SCF response. The authors demonstrate multiple high-impact applications, including elasticity with minimal manual effort, inverse materials design, XC functional learning, pseudopotential optimization, propagation of XC uncertainty, and plane-wave basis error estimation, highlighting the potential for gradient-driven design, uncertainty quantification, and differentiable materials design. While forward-mode AD provides strong end-to-end derivatives, challenges remain in extending to reverse-mode AD and handling more complex functionals, spin polarization, and symmetry perturbations, pointing to future work that could further broaden differentiable plane-wave DFT.
Abstract
We present a differentiation framework for plane-wave density-functional theory (DFT) that combines the strengths of forward-mode algorithmic differentiation (AD) and density-functional perturbation theory (DFPT). In the resulting AD-DFPT framework derivatives of any DFT output quantity with respect to any input parameter (e.g. geometry, density functional or pseudopotential) can be computed accurately without deriving gradient expressions by hand. We implement AD-DFPT into the Density-Functional ToolKit (DFTK) and show its broad applicability. Amongst others we consider the inverse design of a semiconductor band gap, the learning of exchange-correlation functional parameters, or the propagation of DFT parameter uncertainties to relaxed structures. These examples demonstrate a number of promising research avenues opened by gradient-driven workflows in first-principles materials modeling.
