Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Xiangxin Zhou, Yi Xiao, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma
TL;DR
This paper tackles the challenge of incorporating protein pocket dynamics into structure-based drug design by introducing DynamicFlow, a full-atom, SE(3)-equivariant flow framework trained on MD-derived apo-holo pairs to jointly transform apo pockets into holo conformations and generate binding ligands. The method employs both continuous-flow matching for geometric coordinates and discrete-flow matching for ligand bond types, extended to a stochastic ODE/SDE variant for robustness, and built on a multiscale architecture that combines atom-level EGNNs with residue-level Transformers. Key contributions include a meticulously curated MISATO-based dataset, a full-atom flow model capable of capturing backbone translations, side-chain torsions, and ligand topology, and demonstrations that generated holo-like pockets improve the performance of traditional SBDD methods while yielding promising ligands with favorable pharmacokinetic properties. The approach advances practical drug discovery by providing physically informed holo-pocket inputs and end-to-end generative capabilities that account for protein dynamics and induced-fit effects.
Abstract
The dynamic nature of proteins, influenced by ligand interactions, is essential for comprehending protein function and progressing drug discovery. Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures, limiting their practical application in drug development. While molecular dynamics simulation can theoretically capture all the biologically relevant conformations, the transition rate is dictated by the intrinsic energy barrier between them, making the sampling process computationally expensive. To overcome the aforementioned challenges, we propose to use generative modeling for SBDD considering conformational changes of protein pockets. We curate a dataset of apo and multiple holo states of protein-ligand complexes, simulated by molecular dynamics, and propose a full-atom flow model (and a stochastic version), named DynamicFlow, that learns to transform apo pockets and noisy ligands into holo pockets and corresponding 3D ligand molecules. Our method uncovers promising ligand molecules and corresponding holo conformations of pockets. Additionally, the resultant holo-like states provide superior inputs for traditional SBDD approaches, playing a significant role in practical drug discovery.
