Harmonic Self-Conditioned Flow Matching for Multi-Ligand Docking and Binding Site Design
Hannes Stärk, Bowen Jing, Regina Barzilay, Tommi Jaakkola
TL;DR
This work introduces HarmonicFlow, a harmonic-prior, self-conditioned flow-matching approach for 3D docking of multi-ligand complexes, and FlowSite, a joint discrete-continuous generative model that designs protein binding pockets by simultaneously predicting residue identities and ligand poses. HarmonicFlow outperforms state-of-the-art diffusion-based docking methods in pocket-level tasks and provides a robust foundation for FlowSite, which augments the docking process with residue-type generation and a powerful training regime including fake-ligand augmentation and multiple loss terms. FlowSite achieves substantially better binding-site recovery than baselines and approaches an oracle level without access to ground-truth ligand structures, demonstrating the feasibility of automated, generalizable binding-pocket design for single- and multi-ligand scenarios. The framework leverages SE(3)-equivariant refinement TFNs and invariant graph-attention layers to jointly model discrete and continuous data, offering a scalable path toward practical applications in drug design, enzyme engineering, and biomolecular design. The work advances both the theoretical and practical capabilities of generative biomolecular design by unifying structure generation and pocket design under a single, self-conditioned flow framework.
Abstract
A significant amount of protein function requires binding small molecules, including enzymatic catalysis. As such, designing binding pockets for small molecules has several impactful applications ranging from drug synthesis to energy storage. Towards this goal, we first develop HarmonicFlow, an improved generative process over 3D protein-ligand binding structures based on our self-conditioned flow matching objective. FlowSite extends this flow model to jointly generate a protein pocket's discrete residue types and the molecule's binding 3D structure. We show that HarmonicFlow improves upon state-of-the-art generative processes for docking in simplicity, generality, and average sample quality in pocket-level docking. Enabled by this structure modeling, FlowSite designs binding sites substantially better than baseline approaches.
