A fully differentiable framework for training proxy Exchange Correlation Functionals for periodic systems
Rakshit Kumar Singh, Aryan Amit Barsainyan, Bharath Ramsundar
TL;DR
This work tackles the computational bottleneck of DFT for periodic systems by introducing a fully differentiable framework that integrates neural network exchange–correlation functionals into KS-DFT for solids, with gradients flowing through the SCF loop. The approach is implemented in Python with a PyTorch backend and integrated into DeepChem, enabling end-to-end learning via a drop-in XC functional API and a HybridXC formulation that blends ML with conventional GGA physics. Benchmarks on Al, Ni, and Ca show 5–10% relative errors relative to established codes, while graphene highlights current limitations and the need for improved basis representations. The framework advances the ability to train ML-based XC functionals in periodic systems, offering a scalable path toward accurate, differentiable, and physically grounded solids simulations, with plane-wave extensions proposed for future work.
Abstract
Density Functional Theory (DFT) is widely used for first-principles simulations in chemistry and materials science, but its computational cost remains a key limitation for large systems. Motivated by recent advances in ML-based exchange-correlation (XC) functionals, this paper introduces a differentiable framework that integrates machine learning models into density functional theory (DFT) for solids and other periodic systems. The framework defines a clean API for neural network models that can act as drop in replacements for conventional exchange-correlation (XC) functionals and enables gradients to flow through the full self-consistent DFT workflow. The framework is implemented in Python using a PyTorch backend, making it fully differentiable and easy to use with standard deep learning tools. We integrate the implementation with the DeepChem library to promote the reuse of established models and to lower the barrier for experimentation. In initial benchmarks against established electronic structure packages (GPAW and PySCF), our models achieve relative errors on the order of 5-10%.
