Table of Contents
Fetching ...

Gradient Propagation in Retrosynthetic Space: An Efficient Framework for Synthesis Plan Generation

Chengyang Tian, Yuhang Chang, Yangpeng Zhang, Yang Liu

TL;DR

This work addresses retrosynthetic planning under uncertain chemical space by reframing the problem as an AND-OR graph search and introducing a gradient-propagation framework to maximize the Successful Synthesis Probability ($SSP$) across multiple routes. It combines differentiable SSP estimation with bottom-up $s$-value calculations and top-down gradient propagation to drive a greedy, influence-based node expansion, achieving efficient search and improved SSP. Key contributions include a bottom-up \\textit{s-value} update, a top-down gradient mechanism for node selection, and extensive ablations demonstrating efficiency gains over state-of-the-art baselines on large-scale benchmarks. The approach offers broad applicability to multi-route generation in synthesis planning and provides a foundation for integrating uncertainty modeling with efficient, gradient-guided search.

Abstract

Retrosynthesis, which aims to identify viable synthetic pathways for target molecules by decomposing them into simpler precursors, is often treated as a search problem. However, its complexity arises from multi-branched tree-structured pathways rather than linear paths. Some algorithms have been successfully applied in this task, but they either overlook the uncertainties inherent in chemical space or face limitations in practical application scenarios. To address these challenges, this paper introduces a novel gradient-propagation-based algorithmic framework for retrosynthetic route exploration. The proposed framework obtains the contributions of different nodes to the target molecule's success probability through gradient propagation and then guides the algorithm to greedily select the node with the highest contribution for expansion, thereby conducting efficient search in the chemical space. Experimental validations demonstrate that our algorithm achieves broad applicability across diverse molecular targets and exhibits superior computational efficiency compared to existing methods.

Gradient Propagation in Retrosynthetic Space: An Efficient Framework for Synthesis Plan Generation

TL;DR

This work addresses retrosynthetic planning under uncertain chemical space by reframing the problem as an AND-OR graph search and introducing a gradient-propagation framework to maximize the Successful Synthesis Probability () across multiple routes. It combines differentiable SSP estimation with bottom-up -value calculations and top-down gradient propagation to drive a greedy, influence-based node expansion, achieving efficient search and improved SSP. Key contributions include a bottom-up \\textit{s-value} update, a top-down gradient mechanism for node selection, and extensive ablations demonstrating efficiency gains over state-of-the-art baselines on large-scale benchmarks. The approach offers broad applicability to multi-route generation in synthesis planning and provides a foundation for integrating uncertainty modeling with efficient, gradient-guided search.

Abstract

Retrosynthesis, which aims to identify viable synthetic pathways for target molecules by decomposing them into simpler precursors, is often treated as a search problem. However, its complexity arises from multi-branched tree-structured pathways rather than linear paths. Some algorithms have been successfully applied in this task, but they either overlook the uncertainties inherent in chemical space or face limitations in practical application scenarios. To address these challenges, this paper introduces a novel gradient-propagation-based algorithmic framework for retrosynthetic route exploration. The proposed framework obtains the contributions of different nodes to the target molecule's success probability through gradient propagation and then guides the algorithm to greedily select the node with the highest contribution for expansion, thereby conducting efficient search in the chemical space. Experimental validations demonstrate that our algorithm achieves broad applicability across diverse molecular targets and exhibits superior computational efficiency compared to existing methods.
Paper Structure (30 sections, 16 equations, 11 figures, 3 tables, 1 algorithm)

This paper contains 30 sections, 16 equations, 11 figures, 3 tables, 1 algorithm.

Figures (11)

  • Figure 1: The expansion process of an AND-OR tree. Blue nodes represents purchasable molecules, while gray nodes are other unpurchasable leaf nodes. Green nodes are successful nodes within a route. Reaction $R_3$ is NOT viable because it is an AND node with one unsuccessful node $M_5$.
  • Figure 2: Circles represent molecule nodes, and rectangles represent reaction nodes. The numbers denotes the calculation order of each node. (a) In a tree structure, the bottom-up order is equivalent to the reversed topological order. (b) In acyclic graphs, the bottom-up calculation order may differ from the reversed topological order. In the bottom-up update, the top node is only enqueued once. (c) In a cyclic graph, loops are avoided by not to enqueue the node "1" twice.
  • Figure 3: Average SSP achieved by algorithms on the 190 "hard" molecules under different default values $s_0$. Error bars represent standard errors. Detail settings are in Appendix \ref{['a_setup']}.
  • Figure 4: The structures of OR tree, AND-OR tree. In an OR tree, each node represents a set of molecules. In an AND-OR tree, there are two types of nodes. Each OR node (circle) represents a single molecule, while each AND node (rectangle) represent a chemical reaction.
  • Figure 5: Accurate probability calculating process under two cyclic graphs. Green arrows represent the calculating order.
  • ...and 6 more figures