PILOT: Equivariant diffusion for pocket conditioned de novo ligand generation with multi-objective guidance via importance sampling
Julian Cremer, Tuan Le, Frank Noé, Djork-Arné Clevert, Kristof T. Schütt
TL;DR
PILOT addresses de novo ligand generation conditioned on protein pockets while optimizing multiple properties relevant to drug design. It combines an equivariant diffusion backbone with pocket conditioning and trajectory-based importance sampling guided by property surrogates, enabling backpropagation-free multi-objective control. Pre-training on a large conformer dataset and fine-tuning on CrossDocked2020 yield superior structural validity and drug-likeness, and Kinodata-3D experiments show enhanced potency predictions with maintained docking performance. The approach demonstrates practical potential for structure-based drug design, while highlighting data quality and distribution challenges and pointing toward integration with synthesis pipelines and expansion to biologics.
Abstract
The generation of ligands that both are tailored to a given protein pocket and exhibit a range of desired chemical properties is a major challenge in structure-based drug design. Here, we propose an in-silico approach for the $\textit{de novo}$ generation of 3D ligand structures using the equivariant diffusion model PILOT, combining pocket conditioning with a large-scale pre-training and property guidance. Its multi-objective trajectory-based importance sampling strategy is designed to direct the model towards molecules that not only exhibit desired characteristics such as increased binding affinity for a given protein pocket but also maintains high synthetic accessibility. This ensures the practicality of sampled molecules, thus maximizing their potential for the drug discovery pipeline. PILOT significantly outperforms existing methods across various metrics on the common benchmark dataset CrossDocked2020. Moreover, we employ PILOT to generate novel ligands for unseen protein pockets from the Kinodata-3D dataset, which encompasses a substantial portion of the human kinome. The generated structures exhibit predicted $IC_{50}$ values indicative of potent biological activity, which highlights the potential of PILOT as a powerful tool for structure-based drug design.
