Gold-Standard Chemical Database 137 (GSCDB137): A diverse set of accurate energy differences for assessing and developing density functionals
Jiashu Liang, Martin Head-Gordon
TL;DR
GSCDB137 addresses the need for a gold-standard, diverse benchmark for density functional development by assembling $137$ data sets ($8{,}377$ entries) that span main-group and transition-metal chemistry, noncovalent interactions, and molecular properties including dipoles, polarizabilities, oriented electric-field energies, and vibrational frequencies. It updates and reconciles legacy GMTKN55/MGCDB84 references, prunes spin-contaminated points, and adds new TM- and property-focused data, all with transparent references and supplemental material to enable reproducibility. A broad DFT benchmark across $29$ functionals confirms the general Jacob’s ladder trend, with double hybrids offering the best accuracy but requiring careful handling of basis-set, frozen-core, and spin-symmetry issues; category-specific deviations (notably electric-field and frequency properties) highlight the need to incorporate response quantities into functional design. The database provides practical guidance for practitioners—favoring specific functionals for main-group and TM chemistry, and adopting robust strategies for double hybrids—while serving as a versatile training ground for future non-empirical and ML-based functionals. All data and analysis tools are openly available at the project GitHub repository to support ongoing methodological advancement in computational chemistry.
Abstract
We present GSCDB137, a rigorously curated benchmark library of 137 data sets (8377 entries) covering main-group and transition-metal reaction energies and barrier heights, (intramolecular) non-covalent interactions, dipole moments, polarizabilities, electric-field response energies, and vibrational frequencies. Legacy data from GMTKN55 and MGCDB84 have been updated to today's best reference values; redundant, spin-contaminated, or low-quality points were removed, and many new, property-focused sets were added. Testing 29 popular density functional approximations (DFAs) confirms the expected Jacob's-ladder hierarchy overall but also reveals notable exceptions: functional performance for frequencies and electric-field properties correlates poorly with that for other ground-state energetics. ωB97M-V and ωB97X-V are the most balanced hybrid meta-GGA and hybrid GGA, respectively; B97M-V and revPBE-D4 lead the meta-GGA and GGA classes. Double hybrids lower mean errors by about 25 % versus the best hybrids but demand careful frozen-core, basis-set, and multi-reference treatment. GSCDB137 offers a comprehensive, openly documented platform for stringent DFA validation and for training the next generation of non-empirical and machine-learned functionals.
