The Interoperability Challenge in DFT Workflows Across Implementations
S. K. Steensen, T. S. Thakur, M. Dillenz, J. M. Carlsson, C. R. C. Rego, E. Flores, H. Hajiyani, F. Hanke, J. M. G. Lastra, W. Wenzel, N. Marzari, T. Vegge, G. Pizzi, I. E. Castelli
TL;DR
The paper presents a practical framework for interoperable DFT workflows by introducing a universal JSON/YAML input–output standard that can be translated by different workflow managers into engine-specific inputs. It demonstrates engine-agnostic execution of an open-circuit voltage workflow across CASTEP, GPAW, Quantum ESPRESSO, and VASP and analyzes how to reconcile energetics from different engines, particularly for non-pristine, vacancy-containing structures. Key findings show that pristine-cell OCVs align across codes (often within a few hundredths of a volt), but vacancy-related energetics are sensitive to smearing, relaxation procedures, and pseudopotential choices, requiring cross-code validation and robust workflow design. The work culminates in design principles for robust automated DFT workflows, the semantic JSON-LD description of workflows to improve FAIRness, and provides code/data availability to enable reproducible cross-code benchmarking and scalable MAP integration.
Abstract
Interoperability and cross-validation remains a significant challenge in the computational materials discovery community. In this context, we introduce a common input/output standard designed for internal translation by various workflow managers (AiiDA, PerQueue, Pipeline Pilot, and SimStack) to produce results in a unified schema. This standard aims to enable engine-agnostic workflow execution across multiple density functional theory (DFT) codes, including CASTEP, GPAW, Quantum ESPRESSO, and VASP. As a demonstration, we have implemented a workflow to calculate the open-circuit voltage across several battery cathode materials using the proposed universal input/output schema. We analyze and resolve the challenges of reconciling energetics computed by different DFT engines and document the code-specific idiosyncrasies that make straightforward comparisons difficult. Motivated by these challenges, we outline general design principles for robust automated DFT workflows. This work represents a practical step towards more reproducible and interoperable workflows for high-throughput materials screening, while highlighting challenges of aligning electronic properties, especially for non-pristine structures.
