CrystalFormer-CSP: Thinking Fast and Slow for Crystal Structure Prediction
Zhendong Cao, Shigang Ou, Lei Wang
TL;DR
CrystalFormer-CSP tackles the CSP problem by unifying space-group-aware generative modeling with energy-based relaxation via a universal MLFF, augmented by reinforcement learning. The method generates candidate crystal structures conditioned on a chemical formula, relaxes them with MLFFs, and ranks by energy above hull to assess stability, achieving competitive benchmarks. The approach demonstrates strong space-group prediction and improved CSP match rates on two datasets, with RL further boosting performance and enabling discovery of lower-energy polymorphs. It provides practical usage via Colab and LLM integration, and offers a promising framework for scalable, end-to-end crystal structure prediction.
Abstract
Crystal structure prediction is a fundamental problem in materials science. We present CrystalFormer-CSP, an efficient framework that unifies data-driven heuristic and physics-driven optimization approaches to predict stable crystal structures for given chemical compositions. The approach combines pretrained generative models for space-group-informed structure generation and a universal machine learning force field for energy minimization. Reinforcement fine-tuning can be employed to further boost the accuracy of the framework. We demonstrate the effectiveness of CrystalFormer-CSP on benchmark problems and showcase its usage via web interface and language model integration.
