Siamese Foundation Models for Crystal Structure Prediction
Liming Wu, Wenbing Huang, Rui Jiao, Jianxing Huang, Liwei Liu, Yipeng Zhou, Hao Sun, Yang Liu, Fuchun Sun, Yuxiang Ren, Jirong Wen
TL;DR
This work introduces DAO, a Siamese foundation-model framework for Crystal Structure Prediction, pairing a diffusion-based structure generator (DAO-G) with an energy predictor (DAO-P) built on the Crysformer graph Transformer to enforce $O(3)$ and periodic invariance. A large crystal pretraining dataset, CrysDB (~940k entries), enables two-stage pretraining for DAO-G with dataset relaxation guided by DAO-P and energy-guided sampling to promote stability. DAO-P provides both diffusion-compatible energy predictions and broad property-prediction capability across eight datasets, achieving state-of-the-art results and enabling accurate $T_c$ estimates for superconductors when augmented with generated structures. Empirically, DAO-G achieves SOTA CSP performance on MP-20 and MPTS-52, demonstrates robust polymorph generation, and, together with DAO-P, attains strong performance on superconductors and diverse material-property tasks, suggesting a scalable pathway for co-designing generative and predictive models in materials science.
Abstract
Crystal Structure Prediction (CSP), which aims to generate stable crystal structures from compositions, represents a critical pathway for discovering novel materials. While structure prediction tasks in other domains, such as proteins, have seen remarkable progress, CSP remains a relatively underexplored area due to the more complex geometries inherent in crystal structures. In this paper, we propose Siamese foundation models specifically designed to address CSP. Our pretrain-finetune framework, named DAO, comprises two complementary foundation models: DAO-G for structure generation and DAO-P for energy prediction. Experiments on CSP benchmarks (MP-20 and MPTS-52) demonstrate that our DAO-G significantly surpasses state-of-the-art (SOTA) methods across all metrics. Extensive ablation studies further confirm that DAO-G excels in generating diverse polymorphic structures, and the dataset relaxation and energy guidance provided by DAO-P are essential for enhancing DAO-G's performance. When applied to three real-world superconductors ($\text{CsV}_3\text{Sb}_5$, $ \text{Zr}_{16}\text{Rh}_8\text{O}_4$ and $\text{Zr}_{16}\text{Pd}_8\text{O}_4$) that are known to be challenging to analyze, our foundation models achieve accurate critical temperature predictions and structure generations. For instance, on $\text{CsV}_3\text{Sb}_5$, DAO-G generates a structure close to the experimental one with an RMSE of 0.0085; DAO-P predicts the $T_c$ value with high accuracy (2.26 K vs. the ground-truth value of 2.30 K). In contrast, conventional DFT calculators like Quantum Espresso only successfully derive the structure of the first superconductor within an acceptable time, while the RMSE is nearly 8 times larger, and the computation speed is more than 1000 times slower. These compelling results collectively highlight the potential of our approach for advancing materials science research and development.
