A Periodic Bayesian Flow for Material Generation
Hanlin Wu, Yuxuan Song, Jingjing Gong, Ziyao Cao, Yawen Ouyang, Jianbing Zhang, Hao Zhou, Wei-Ying Ma, Jingjing Liu
TL;DR
The paper tackles crystal generation under periodic symmetry by modeling non-Euclidean variables on the hyper-torus $\mathbb{T}^{3\times N}$. It introduces CrysBFN, a periodic E(3)-equivariant Bayesian flow that uses a von Mises input distribution and an entropy-conditioning mechanism to handle non-additive accuracy, enabling fast, non-autoregressive sampling. The method jointly handles atom types, lattice parameters, and fractional coordinates, achieving state-of-the-art performance on ab initio generation and crystal structure prediction, with orders-of-magnitude improvements in sampling efficiency. This work broadens the applicability of Bayesian flows to non-Euclidean, periodically invariant data and provides a framework that can be extended to other hyper-torus domains.
Abstract
Generative modeling of crystal data distribution is an important yet challenging task due to the unique periodic physical symmetry of crystals. Diffusion-based methods have shown early promise in modeling crystal distribution. More recently, Bayesian Flow Networks were introduced to aggregate noisy latent variables, resulting in a variance-reduced parameter space that has been shown to be advantageous for modeling Euclidean data distributions with structural constraints (Song et al., 2023). Inspired by this, we seek to unlock its potential for modeling variables located in non-Euclidean manifolds e.g. those within crystal structures, by overcoming challenging theoretical issues. We introduce CrysBFN, a novel crystal generation method by proposing a periodic Bayesian flow, which essentially differs from the original Gaussian-based BFN by exhibiting non-monotonic entropy dynamics. To successfully realize the concept of periodic Bayesian flow, CrysBFN integrates a new entropy conditioning mechanism and empirically demonstrates its significance compared to time-conditioning. Extensive experiments over both crystal ab initio generation and crystal structure prediction tasks demonstrate the superiority of CrysBFN, which consistently achieves new state-of-the-art on all benchmarks. Surprisingly, we found that CrysBFN enjoys a significant improvement in sampling efficiency, e.g., ~100x speedup 10 v.s. 2000 steps network forwards) compared with previous diffusion-based methods on MP-20 dataset. Code is available at https://github.com/wu-han-lin/CrysBFN.
