Mofasa: A Step Change in Metal-Organic Framework Generation
Vaidotas Simkus, Anders Christensen, Steven Bennett, Ian Johnson, Mark Neumann, James Gin, Jonathan Godwin, Benjamin Rhodes
TL;DR
Mofasa introduces an all-atom latent diffusion model capable of generating full MOF structures up to 500 atoms, overcoming prior scalability limits of all-atom methods. By jointly sampling atom types, positions, and lattice vectors within a hierarchical GNS backbone and employing specialized training strategies, it achieves state-of-the-art validity and dynamic stability, and shows strong generalization via rediscovery of unseen nodes and topologies. The work also releases MofasaDB to enable large-scale screening and analysis, and argues for the broad applicability of all-atom diffusion as a foundation model for materials, enabling cross-domain transfer and rapid discovery without reliance on rigid modular building-block schemes. The results indicate that an all-atom approach can match or surpass domain-specific MOF generators, with potential impact across porous materials and crystalline systems. Limitations include computational scaling and conditional sampling challenges, pointing to future work on sparsity, broader material classes, and integration with property-guided search.
Abstract
Mofasa is an all-atom latent diffusion model with state-of-the-art performance for generating Metal-Organic Frameworks (MOFs). These are highly porous crystalline materials used to harvest water from desert air, capture carbon dioxide, store toxic gases and catalyse chemical reactions. In recognition of their value, the development of MOFs recently received a Nobel Prize in Chemistry. In many ways, MOFs are well-suited for exploiting generative models in chemistry: they are rationally-designable materials with a large combinatorial design space and strong structure-property couplings. And yet, to date, a high performance generative model has been lacking. To fill this gap, we introduce Mofasa, a general-purpose latent diffusion model that jointly samples positions, atom-types and lattice vectors for systems as large as 500 atoms. Mofasa avoids handcrafted assembly algorithms common in the literature, unlocking the simultaneous discovery of metal nodes, linkers and topologies. To help the scientific community build on our work, we release MofasaDB, an annotated library of hundreds of thousands of sampled MOF structures, along with a user-friendly web interface for search and discovery: https://mofux.ai/ .
