MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks
Nayoung Kim, Seongsu Kim, Minsu Kim, Jinkyoo Park, Sungsoo Ahn
TL;DR
MOFFlow introduces a building-block based, SE(3)-invariant deep generative model for MOF structure prediction using Riemannian flow matching to predict block roto-translations and lattice parameters. By representing MOFs as rigid blocks and operating in $SE(3)$, it dramatically reduces the search space and scales to unit cells with hundreds to thousands of atoms, outperforming traditional CSP and diffusion-based baselines in both accuracy and speed. The framework enforces crystal symmetries, employs a hierarchical MOF-specific architecture with MOFAttention, and demonstrates superior performance on a large MOF dataset, including accurate property reproduction. These results suggest MOFFlow's potential to accelerate MOF discovery and design by enabling scalable, accurate structure generation and property prediction.
Abstract
Metal-organic frameworks (MOFs) are a class of crystalline materials with promising applications in many areas such as carbon capture and drug delivery. In this work, we introduce MOFFlow, the first deep generative model tailored for MOF structure prediction. Existing approaches, including ab initio calculations and even deep generative models, struggle with the complexity of MOF structures due to the large number of atoms in the unit cells. To address this limitation, we propose a novel Riemannian flow matching framework that reduces the dimensionality of the problem by treating the metal nodes and organic linkers as rigid bodies, capitalizing on the inherent modularity of MOFs. By operating in the $SE(3)$ space, MOFFlow effectively captures the roto-translational dynamics of these rigid components in a scalable way. Our experiment demonstrates that MOFFlow accurately predicts MOF structures containing several hundred atoms, significantly outperforming conventional methods and state-of-the-art machine learning baselines while being much faster.
