DeFecT-FF: Accelerated Modeling of Defects in Cd-Zn--Te-Se-S Compounds Combining High-Throughput DFT and Machine Learning Force Fields
Md Habibur Rahman, Arun Mannodi-Kanakkithodi
TL;DR
The paper addresses the computational bottleneck of mapping defect landscapes in Cd/Zn–Te/Se/S alloys essential for CdTe solar cells. It introduces DeFecT-FF, which fuses high-throughput DFT (GGA-PBE and hybrid HSE06+SOC) with crystal-graph ML force fields (M3GNet ALIGNN-based MLFFs) trained to hybrid-functional accuracy, guided by active learning and ShakeNBreak sampling. The authors deliver the largest unified HSE06 defect dataset across Cd/Zn–Te/Se/S compositions and present a workflow and nanoHUB tool that enables defect enumeration, MLFF optimization, and defect formation energy diagrams as functions of the Fermi level $E_F$ and chemical potentials, greatly reducing the need for expensive DFT relaxations. These advances yield near-DFT accuracy with substantial speedups (e.g., multi-hour DFT relaxations reduced to minutes) and enable rapid, charge-aware defect surveys to guide alloying and doping strategies for CdSeTe solar cells, ultimately aiming to close the voltage deficit in this photovoltaic platform.
Abstract
We developed DeFecT-FF, a framework for predicting the energies and ground-state configurations of native point defects, extrinsic dopants, impurities, and defect complexes in zincblende-phase Cd/Zn-Te/Se/S compounds relevant to CdTe-based solar cells. The framework combines high-throughput DFT data with crystal graph-based machine learning force fields (MLFFs) trained to reproduce DFT energies and forces. Alloying at Cd or Te sites offers a route to tune the electronic and defect properties of CdTe absorbers for improved solar efficiency. Given the vast number of possible defect types, charge states, and symmetry-breaking configurations, traditional DFT approaches are computationally prohibitive. Our dataset includes GGA-PBE and HSE06-optimized structures for bulk, alloyed, interface, and grain-boundary systems. Using active learning, we expanded the dataset and trained MLFFs to accurately predict energies across charge states. The model enabled rapid screening and discovery of new low-energy defect configurations, validated through HSE06 calculations with spin-orbit coupling. The DeFecT-FF framework is publicly available as a nanoHUB tool, allowing users to upload crystallographic files, automatically generate defects, and compute defect formation energies versus Fermi level and chemical potentials, greatly reducing the need for expensive DFT simulations.
