Accelerating inverse materials design using generative diffusion models with reinforcement learning
Junwu Chen, Jeff Guo, Edvin Fako, Philippe Schwaller
TL;DR
This work introduces MatInvent, an RL-driven framework that fine-tunes pre-trained diffusion models to perform goal-directed crystal generation with multiple, potentially conflicting, property objectives. By modeling denoising steps as a multi-step MDP and applying reward-weighted KL regularization together with experience replay and a diversity filter, MatInvent delivers fast convergence (around 60 iterations) and substantial reductions in property evaluation needs compared with conditional generation approaches. The approach achieves both single-property and multi-property optimization across electronic, magnetic, mechanical, thermal, and dielectric domains, producing SUN structures with target-like properties under tight computational budgets and enabling Pareto optimization for challenging dielectric/magnet design tasks. The method demonstrates broad compatibility with diffusion architectures and suggests promising directions for uncertainty-aware predictors, curriculum-based multi-objective strategies, and integration with automated laboratories for closed-loop material discovery.
Abstract
Diffusion models promise to accelerate material design by directly generating novel structures with desired properties, but existing approaches typically require expensive and substantial labeled data ($>$10,000) and lack adaptability. Here we present MatInvent, a general and efficient reinforcement learning workflow that optimizes diffusion models for goal-directed crystal generation. For single-objective designs, MatInvent rapidly converges to target values within 60 iterations ($\sim$ 1,000 property evaluations) across electronic, magnetic, mechanical, thermal, and physicochemical properties. Furthermore, MatInvent achieves robust optimization in design tasks with multiple conflicting properties, successfully proposing low-supply-chain-risk magnets and high-$κ$ dielectrics. Compared to state-of-the-art methods, MatInvent exhibits superior generation performance under specified property constraints while dramatically reducing the demand for property computation by up to 378-fold. Compatible with diverse diffusion model architectures and property constraints, MatInvent could offer broad applicability in materials discovery.
