Physics- and data-driven Active Learning of neural network representations for free energy functions of materials from statistical mechanics
Jamie Holber, Krishna Garikipati
TL;DR
The paper tackles the challenge of building accurate, atomistically informed free energy representations for materials by learning a differentiable surrogate of the free energy density $g(\vec{\eta})$ from derivative data $\vec{\mu}=\partial g/\partial \vec{\eta}$ obtained via DFT-informed Monte Carlo. It integrates an Integrable Deep Neural Network (IDNN) with symmetry-invariant features to ensure physically consistent energy landscapes, and couples this with a versatile Active Learning workflow that blends global space-filling sampling, physics-driven targeted sampling (wells, endpoints, regions of instability, and lowest-energy paths), and uncertainty-based criteria (High Error, Highly Sensitive Regions, QBC) plus novelty and dynamic hyperparameter reweighting. The framework demonstrates how different sampling strategies impact global and regional accuracy, showing that combinations like 7D plus limited 2D sampling and QBC or novelty-enhanced approaches can substantially reduce the required data while preserving fidelity in physically relevant regions. The work offers a robust, scalable methodology for Monte Carlo sampling of high-dimensional free energy landscapes, with direct applicability to phase-field modeling and materials design. The practical impact lies in enabling efficient, principled generation of atomistically informed free energy models for complex materials systems, reducing computational cost and improving predictive reliability in phase dynamics simulations.
Abstract
Accurate free energy representations are crucial for understanding phase dynamics in materials. We employ a scale-bridging approach to incorporate atomistic information into our free energy model by training a neural network on DFT-informed Monte Carlo data. To optimize sampling in the high-dimensional Monte Carlo space, we present an Active Learning framework that integrates space-filling sampling, uncertainty-based sampling, and physics-informed sampling. Additionally, our approach includes methods such as hyperparameter tuning, dynamic sampling, and novelty enforcement. These strategies can be combined to reduce MSE,either globally or in targeted regions of interest,while minimizing the number of required data points. The framework introduced here is broadly applicable to Monte Carlo sampling of a range of materials systems.
