Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction
Ziyi Chen, Yang Yuan, Siming Zheng, Jialong Guo, Sihan Liang, Yangang Wang, Zongguo Wang
TL;DR
This work introduces TransVAE-CSP, a Transformer-Enhanced Variational Autoencoder for crystal structure prediction that jointly reconstructs and generates crystal structures. It advances representation learning by combining adaptive distance expansion with irreducible representations and an encoder built on an equivariant dot-product attention mechanism to capture $E(3)$/SE(3) symmetry. The approach is validated on carbon_24, perov_5, and mp_20, showing superior reconstruction and generation performance relative to several baselines, with dataset-specific RBF choices further enhancing results. The findings demonstrate a robust, distribution-focused CSP framework that enables efficient crystal design and optimization, with future work aimed at closing gaps with diffusion models and enabling composition-conditioned generation.
Abstract
Crystal structure forms the foundation for understanding the physical and chemical properties of materials. Generative models have emerged as a new paradigm in crystal structure prediction(CSP), however, accurately capturing key characteristics of crystal structures, such as periodicity and symmetry, remains a significant challenge. In this paper, we propose a Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction (TransVAE-CSP), who learns the characteristic distribution space of stable materials, enabling both the reconstruction and generation of crystal structures. TransVAE-CSP integrates adaptive distance expansion with irreducible representation to effectively capture the periodicity and symmetry of crystal structures, and the encoder is a transformer network based on an equivariant dot product attention mechanism. Experimental results on the carbon_24, perov_5, and mp_20 datasets demonstrate that TransVAE-CSP outperforms existing methods in structure reconstruction and generation tasks under various modeling metrics, offering a powerful tool for crystal structure design and optimization.
