Generative modeling of protein ensembles guided by crystallographic electron densities
Sai Advaith Maddipatla, Nadav Bojan Sellam, Sanketh Vedula, Ailie Marx, Alex Bronstein
TL;DR
This work tackles the challenge of reconstructing dynamic protein ensembles from crystallographic electron density by formulating an inverse problem that aligns multiple conformations with observed density. It introduces a density-guided sampling framework that uses a pre-trained diffusion model (Chroma) as a prior and a differentiable forward model of electron density, employing non-i.i.d. score guidance across the entire ensemble and matching-pursuit filtering to prevent overfitting to noise. The authors demonstrate that their approach recovers multi-modal altloc conformations and yields improved density alignment compared with unconditional sampling, particularly in regions with bimodal density. This methodology enables more accurate, data-driven modeling of protein dynamics and opens avenues for applying similar strategies to other experimental modalities and larger systems.
Abstract
Proteins are dynamic, adopting ensembles of conformations. The nature of this conformational heterogenity is imprinted in the raw electron density measurements obtained from X-ray crystallography experiments. Fitting an ensemble of protein structures to these measurements is a challenging, ill-posed inverse problem. We propose a non-i.i.d. ensemble guidance approach to solve this problem using existing protein structure generative models and demonstrate that it accurately recovers complicated multi-modal alternate protein backbone conformations observed in certain single crystal measurements.
