Generative models for crystalline materials
Houssam Metni, Laura Ruple, Lauren N. Walters, Luca Torresi, Jonas Teufel, Henrik Schopmans, Jona Östreicher, Yumeng Zhang, Marlen Neubert, Yuri Koide, Kevin Steiner, Paul Link, Lukas Bär, Mariana Petrova, Gerbrand Ceder, Pascal Friederich
TL;DR
This review surveys the emergence of end-to-end generative modeling for crystalline materials, detailing crystal representations, data resources, and a spectrum of generative approaches from VAEs and GANs to diffusion and GFlowNets. It contrasts traditional CSP-and-screening pipelines with end-to-end methods that aim to directly propose stable, synthesizable crystal structures while respecting crystallographic symmetry. The article also covers practical considerations, including conditioning, software availability, computational cost, evaluation metrics, and post-generation synthesis workflows, and discusses emerging topics like defects, disorder, and synthesis-aware design. Collectively, it highlights the progress and remaining challenges in translating ML-generated crystal structures into experimentally realizable materials, and outlines directions for faster, symmetry-informed, and synthesis-conscious generation models with broad impact on materials discovery. The work serves as a guide for experimentalists and ML researchers to navigate representations, benchmarks, and practical deployment in inverse materials design.
Abstract
Understanding structure-property relationships in materials is fundamental in condensed matter physics and materials science. Over the past few years, machine learning (ML) has emerged as a powerful tool for advancing this understanding and accelerating materials discovery. Early ML approaches primarily focused on constructing and screening large material spaces to identify promising candidates for various applications. More recently, research efforts have increasingly shifted toward generating crystal structures using end-to-end generative models. This review analyzes the current state of generative modeling for crystal structure prediction and \textit{de novo} generation. It examines crystal representations, outlines the generative models used to design crystal structures, and evaluates their respective strengths and limitations. Furthermore, the review highlights experimental considerations for evaluating generated structures and provides recommendations for suitable existing software tools. Emerging topics, such as modeling disorder and defects, integration in advanced characterization, and incorporating synthetic feasibility constraints, are explored. Ultimately, this work aims to inform both experimental scientists looking to adapt suitable ML models to their specific circumstances and ML specialists seeking to understand the unique challenges related to inverse materials design and discovery.
