Machine Learning-Driven Insights into Excitonic Effects in 2D Materials
Ahsan Javed, Sajid Ali
TL;DR
The paper tackles the computational bottleneck of determining exciton binding energies in 2D materials by developing a machine learning framework trained on C2DB descriptors to predict EBE and G0W0 band gaps from inexpensive features. Random Forest models deliver strong predictive performance (G0W0 gap R^2 ~ 0.98; EBE R^2 ~ 0.84), with SHAP analysis highlighting the PBE band gap and layer thickness as key drivers. A Bayesian optimization loop using Gaussian process regression and Expected Improvement accelerates the search, efficiently identifying high-EBE monolayers from ~4001 candidates and listing notable candidates in both ACl2-type and TMDC families. The approach offers a rapid, scalable screening pathway for 2D optoelectronic materials and could be extended to three-dimensional systems, enabling broader data-driven materials discovery.
Abstract
Understanding excitonic effects in two-dimensional (2D) materials is critical for advancing their potential in next-generation electronic and photonic devices. In this study, we introduce a machine learning (ML)-based framework to predict exciton binding energies in 2D materials, offering a computationally efficient alternative to traditional methods such as many-body perturbation theory (GW) and the Bethe-Salpeter equation. Leveraging data from the Computational 2D Materials Database (C2DB), our ML models establish connections between cheaply available material descriptors and complex excitonic properties, significantly accelerating the screening process for materials with pronounced excitonic effects. Additionally, Bayesian optimization with Gaussian process regression was employed to efficiently filter materials with largest exciton binding energies, further enhancing the discovery process. Although developed for 2D systems, this approach is versatile and can be extended to three-dimensional materials, broadening its applicability in materials discovery.
