CryoFM: A Flow-based Foundation Model for Cryo-EM Densities
Yi Zhou, Yilai Li, Jing Yuan, Quanquan Gu
TL;DR
CryoFM presents a flow-matching foundation model that learns the prior distribution of high-quality cryo-EM density maps and enables posterior sampling conditioned on observed data without task-specific fine-tuning. It deploys two architectures, CryoFM-S for local detail and CryoFM-L for global structure, built on a 3D HDiT transformer and trained on EMDB maps to model $p_0({\mathbf{x}}_0)$. By deriving a flow posterior sampling method, CryoFM can tackle downstream tasks such as spectral and anisotropic noise denoising, missing wedge restoration, and ab initio modeling within a plug-and-play framework, achieving state-of-the-art performance on most tasks. The work demonstrates the potential of flow-based foundation models to unify and accelerate cryo-EM density processing, while noting challenges in real-world noisy densities and reconstruction from raw 2D particles for future research.
Abstract
Cryo-electron microscopy (cryo-EM) is a powerful technique in structural biology and drug discovery, enabling the study of biomolecules at high resolution. Significant advancements by structural biologists using cryo-EM have led to the production of over 38,626 protein density maps at various resolutions1. However, cryo-EM data processing algorithms have yet to fully benefit from our knowledge of biomolecular density maps, with only a few recent models being data-driven but limited to specific tasks. In this study, we present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps and generalizing effectively to downstream tasks. Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps. Furthermore, we introduce a flow posterior sampling method that leverages CRYOFM as a flexible prior for several downstream tasks in cryo-EM and cryo-electron tomography (cryo-ET) without the need for fine-tuning, achieving state-of-the-art performance on most tasks and demonstrating its potential as a foundational model for broader applications in these fields.
