Flexible MOF Generation with Torsion-Aware Flow Matching
Nayoung Kim, Seongsu Kim, Sungsoo Ahn
TL;DR
MOFFlow-2 introduces a two-stage framework that jointly enables novel MOF chemistries and accurate 3D structure generation without a fixed block library or rigid conformations. The first stage autoregressively generates building blocks in SMILES form, while the second stage uses a torsion-aware flow model to predict translations, rotations, torsions, and lattice parameters for full 3D assembly. By explicitly modeling torsion angles and employing canonicalization and MOF matching to mitigate symmetry and distributional shifts, MOFFlow-2 achieves higher reconstruction accuracy and generates valid, novel, and diverse MOFs, including blocks unseen during training. The approach advances automated MOF design with practical improvements in both structure prediction and generative design, while acknowledging limitations related to initialization tools and energy evaluation. Overall, MOFFlow-2 represents a significant step toward flexible, scalable MOF discovery and design.
Abstract
Designing metal-organic frameworks (MOFs) with novel chemistries is a longstanding challenge due to their large combinatorial space and complex 3D arrangements of the building blocks. While recent deep generative models have enabled scalable MOF generation, they assume (1) a fixed set of building blocks and (2) known local 3D coordinates of building blocks. However, this limits their ability to (1) design novel MOFs and (2) generate the structure using novel building blocks. We propose a two-stage MOF generation framework that overcomes these limitations by modeling both chemical and geometric degrees of freedom. First, we train an SMILES-based autoregressive model to generate metal and organic building blocks, paired with a cheminformatics toolkit for 3D structure initialization. Second, we introduce a flow matching model that predicts translations, rotations, and torsional angles to assemble the blocks into valid 3D frameworks. Our experiments demonstrate improved reconstruction accuracy, the generation of valid, novel, and unique MOFs, and the ability to create novel building blocks. Our code is available at https://github.com/nayoung10/MOFFlow-2.
