AlphaFold Meets Flow Matching for Generating Protein Ensembles
Bowen Jing, Bonnie Berger, Tommi Jaakkola
TL;DR
The paper tackles protein conformational heterogeneity by converting single-state predictors like AlphaFold and ESMFold into flow-matching, sequence-conditioned generators to sample structural ensembles. It introduces AlphaFlow and ESMFlow, which use a harmonic prior and an SE(3) quotient-space formulation with Fréchet-mean supervision to produce diverse yet accurate ensembles. Empirical results show superior precision-diversity tradeoffs on PDB ensembles compared with MSA subsampling, and faithful reproduction of MD-derived distributions and higher-order observables when trained on ATLAS MD ensembles, with faster convergence than explicit MD in some cases. The approach also yields efficient, distillable inference, offering a practical proxy for expensive physics-based simulations and broad applicability to structural biology tasks.
Abstract
The biological functions of proteins often depend on dynamic structural ensembles. In this work, we develop a flow-based generative modeling approach for learning and sampling the conformational landscapes of proteins. We repurpose highly accurate single-state predictors such as AlphaFold and ESMFold and fine-tune them under a custom flow matching framework to obtain sequence-conditoned generative models of protein structure called AlphaFlow and ESMFlow. When trained and evaluated on the PDB, our method provides a superior combination of precision and diversity compared to AlphaFold with MSA subsampling. When further trained on ensembles from all-atom MD, our method accurately captures conformational flexibility, positional distributions, and higher-order ensemble observables for unseen proteins. Moreover, our method can diversify a static PDB structure with faster wall-clock convergence to certain equilibrium properties than replicate MD trajectories, demonstrating its potential as a proxy for expensive physics-based simulations. Code is available at https://github.com/bjing2016/alphaflow.
