DMFlow: Disordered Materials Generation by Flow Matching
Liming Wu, Rui Jiao, Qi Li, Mingze Li, Songyou Li, Shifeng Jin, Wenbing Huang
TL;DR
DMFlow tackles the generation of disordered crystals by introducing a unified representation for ordered, Substitutional Disorder (SD), and Positional Disorder (PD) alongside a Riemannian flow matching framework that enforces simplex constraints via spherical reparameterization. A novel Velocity Prediction Network (GNN) processes continuous disorder inputs and multi-position interactions to produce physically valid generation trajectories for lattice parameters, fractional coordinates, and disorder weights. A robust two-stage discretization converts continuous disorder into multi-hot atomic assignments, enabling realistic structure generation; and a COD-derived benchmark for SD, PD, and SPD structures supports CSP and DNG tasks. Empirical results show that DMFlow outperforms adapted baselines on SPD cases and maintains strong performance on SD cases, demonstrating the value of a unified, geometry-aware generative framework for disordered materials. This work paves the way for AI-driven discovery of disordered materials with tunable properties.
Abstract
The design of materials with tailored properties is crucial for technological progress. However, most deep generative models focus exclusively on perfectly ordered crystals, neglecting the important class of disordered materials. To address this gap, we introduce DMFlow, a generative framework specifically designed for disordered crystals. Our approach introduces a unified representation for ordered, Substitutionally Disordered (SD), and Positionally Disordered (PD) crystals, and employs a flow matching model to jointly generate all structural components. A key innovation is a Riemannian flow matching framework with spherical reparameterization, which ensures physically valid disorder weights on the probability simplex. The vector field is learned by a novel Graph Neural Network (GNN) that incorporates physical symmetries and a specialized message-passing scheme. Finally, a two-stage discretization procedure converts the continuous weights into multi-hot atomic assignments. To support research in this area, we release a benchmark containing SD, PD, and mixed structures curated from the Crystallography Open Database. Experiments on Crystal Structure Prediction (CSP) and De Novo Generation (DNG) tasks demonstrate that DMFlow significantly outperforms state-of-the-art baselines adapted from ordered crystal generation. We hope our work provides a foundation for the AI-driven discovery of disordered materials.
