Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

Siyi Gu; Minkai Xu; Alexander Powers; Weili Nie; Tomas Geffner; Karsten Kreis; Jure Leskovec; Arash Vahdat; Stefano Ermon

Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

Siyi Gu, Minkai Xu, Alexander Powers, Weili Nie, Tomas Geffner, Karsten Kreis, Jure Leskovec, Arash Vahdat, Stefano Ermon

TL;DR

This paper proposes a novel and general alignment framework to align pretrained target diffusion models with preferred functional properties, named AliDiff, and develops an improved Exact Energy Preference Optimization method to yield an exact and efficient alignment of the diffusion models.

Abstract

Generating ligand molecules for specific protein targets, known as structure-based drug design, is a fundamental problem in therapeutics development and biological discovery. Recently, target-aware generative models, especially diffusion models, have shown great promise in modeling protein-ligand interactions and generating candidate drugs. However, existing models primarily focus on learning the chemical distribution of all drug candidates, which lacks effective steerability on the chemical quality of model generations. In this paper, we propose a novel and general alignment framework to align pretrained target diffusion models with preferred functional properties, named AliDiff. AliDiff shifts the target-conditioned chemical distribution towards regions with higher binding affinity and structural rationality, specified by user-defined reward functions, via the preference optimization approach. To avoid the overfitting problem in common preference optimization objectives, we further develop an improved Exact Energy Preference Optimization method to yield an exact and efficient alignment of the diffusion models, and provide the closed-form expression for the converged distribution. Empirical studies on the CrossDocked2020 benchmark show that AliDiff can generate molecules with state-of-the-art binding energies with up to -7.07 Avg. Vina Score, while maintaining strong molecular properties. Code is available at https://github.com/MinkaiXu/AliDiff.

Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

TL;DR

Abstract

Paper Structure (17 sections, 4 theorems, 18 equations, 7 figures, 6 tables, 2 algorithms)

This paper contains 17 sections, 4 theorems, 18 equations, 7 figures, 6 tables, 2 algorithms.

Introduction
Related Work
Method
Overview
Energy Preference Optimization
Exact Energy Optimization
Experiment
Experiment Setup
Results
Ablation Studies
Conclusion
Limitations and Future Work
Algorithm
Proof
Implementation Details
...and 2 more sections

Key Result

Theorem 3.1

The objective function in eq:loss-alidiff-exact optimizes a variational upper bound of the KL-divergence $\mathbb{D}_\textnormal{KL}(\hat{p}^*({\mathbf{m}}|{\mathbf{p}})||\hat{p}_\theta({\mathbf{m}}|{\mathbf{p}}))$, where $\hat{p}^*({\mathbf{m}}|{\mathbf{p}}) \propto p_\textnormal{ref}({\mathbf{m}}|

Figures (7)

Figure 1: High-level illustration of AliDiff. For a protein target, we can have multiple candidate ligands and rank the preference by certain reward functions, e.g., binding energy. We align the target-aware molecule diffusion model with these preferences by adjusting the conditional likelihoods.
Figure 2: Overview of AliDiff. This workflow can be summarized as 1) For each protein target (pocket) ${\mathbf{p}}$ in the training set, we retrieve two candidate ligands ${\mathbf{m}}$; 2) Label the two ligands as wining sample ${\mathbf{m}}^w$ and losing sample ${\mathbf{m}}^l$ by desirable properties, e.g., binding energies; 3) Calculate the preference optimization objective \ref{['eq:loss-alidiff-exact']} and update the molecule diffusion model $p_\theta$.
Figure 3: Median Vina energy for different generated molecules (TargetDiff, IPDiff, AliDiff) across 100 testing samples, sorted by the median Vina energy of molecules generated from AliDiff.
Figure 4: Visualizations of reference molecules and generated ligands for protein pockets (1l3l, 2e24) generated by TargetDiff, IPDiff, and AliDiff. Vina score, QED, and SA are reported below.
Figure 5: Ablation analysis of AliDiff under different $\beta$. Vina Score, High Affinity, QED, and diversity are reported, where blue lines represent AliDiff-DPO, and orange lines represent AliDiff. The dotted lines represent the baseline IPDiff.
...and 2 more figures

Theorems & Definitions (5)

Theorem 3.1
Theorem \ref{theorem-kl}
Lemma C.1
Lemma C.2
proof

Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

TL;DR

Abstract

Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (5)