Table of Contents
Fetching ...

Structure-based Drug Design Benchmark: Do 3D Methods Really Dominate?

Kangyu Zheng, Yingzhou Lu, Zaixi Zhang, Zhongwei Wan, Yao Ma, Marinka Zitnik, Tianfan Fu

TL;DR

This paper addresses the need for a cross-algorithm benchmark in structure-based drug design by evaluating 16 models spanning 1D, 2D, and 3D design paradigms on both molecular properties and docking affinities to seven target proteins. It demonstrates that 1D/2D ligand-centric approaches, when treated with docking as a black-box oracle, can match or exceed 3D methods, with AutoGrow4 (a 2D genetic algorithm) achieving the strongest overall optimization performance. Across the results, no single method dominates all metrics (docking, drug-likeness, synthesizability, and generative quality); nonetheless, genetic algorithms, particularly AutoGrow4, show robust docking performance and favorable synthetic accessibility. The findings advocate for hybrid SBDD designs that integrate GA with other computational strategies to optimize both binding affinity and molecular properties, highlighting practical implications for accelerating drug discovery and informing future benchmark design.

Abstract

Currently, the field of structure-based drug design is dominated by three main types of algorithms: search-based algorithms, deep generative models, and reinforcement learning. While existing works have typically focused on comparing models within a single algorithmic category, cross-algorithm comparisons remain scarce. In this paper, to fill the gap, we establish a benchmark to evaluate the performance of sixteen models across these different algorithmic foundations by assessing the pharmaceutical properties of the generated molecules and their docking affinities with specified target proteins. We highlight the unique advantages of each algorithmic approach and offer recommendations for the design of future SBDD models. We emphasize that 1D/2D ligand-centric drug design methods can be used in SBDD by treating the docking function as a black-box oracle, which is typically neglected. The empirical results show that 1D/2D methods achieve competitive performance compared with 3D-based methods that use the 3D structure of the target protein explicitly. Also, AutoGrow4, a 2D molecular graph-based genetic algorithm, dominates SBDD in terms of optimization ability. The relevant code is available in https://github.com/zkysfls/2024-sbdd-benchmark.

Structure-based Drug Design Benchmark: Do 3D Methods Really Dominate?

TL;DR

This paper addresses the need for a cross-algorithm benchmark in structure-based drug design by evaluating 16 models spanning 1D, 2D, and 3D design paradigms on both molecular properties and docking affinities to seven target proteins. It demonstrates that 1D/2D ligand-centric approaches, when treated with docking as a black-box oracle, can match or exceed 3D methods, with AutoGrow4 (a 2D genetic algorithm) achieving the strongest overall optimization performance. Across the results, no single method dominates all metrics (docking, drug-likeness, synthesizability, and generative quality); nonetheless, genetic algorithms, particularly AutoGrow4, show robust docking performance and favorable synthetic accessibility. The findings advocate for hybrid SBDD designs that integrate GA with other computational strategies to optimize both binding affinity and molecular properties, highlighting practical implications for accelerating drug discovery and informing future benchmark design.

Abstract

Currently, the field of structure-based drug design is dominated by three main types of algorithms: search-based algorithms, deep generative models, and reinforcement learning. While existing works have typically focused on comparing models within a single algorithmic category, cross-algorithm comparisons remain scarce. In this paper, to fill the gap, we establish a benchmark to evaluate the performance of sixteen models across these different algorithmic foundations by assessing the pharmaceutical properties of the generated molecules and their docking affinities with specified target proteins. We highlight the unique advantages of each algorithmic approach and offer recommendations for the design of future SBDD models. We emphasize that 1D/2D ligand-centric drug design methods can be used in SBDD by treating the docking function as a black-box oracle, which is typically neglected. The empirical results show that 1D/2D methods achieve competitive performance compared with 3D-based methods that use the 3D structure of the target protein explicitly. Also, AutoGrow4, a 2D molecular graph-based genetic algorithm, dominates SBDD in terms of optimization ability. The relevant code is available in https://github.com/zkysfls/2024-sbdd-benchmark.
Paper Structure (22 sections, 5 figures, 22 tables)

This paper contains 22 sections, 5 figures, 22 tables.

Figures (5)

  • Figure 1: The bar chart of average generated molecules that are calculated by our selected oracles for each model across all target proteins under given time. 1D methods are colored red, blue is used to indicate 2D methods, and green represents 3D methods.
  • Figure 2: The heatmap based on the average of each model's Top-10 docking score for each target protein.
  • Figure 3: The bar chart is based on the average of each model's Top-10 docking score. On the left, 1D methods are colored red, blue is used to indicate 2D methods, and green represents 3D methods. On the right, generative models are in blue, search-based methods are in green, and reinforcement learning methods are in red.
  • Figure 4: The radar chart is based on the ranking of each model's average performance. The outermost circle represents the best ranking for each metric and vice versa. No methods can dominate structure-based drug design in all the evaluation metrics.
  • Figure 5: Examples of best generated molecules for each PDB