Retro-fallback: retrosynthetic planning in an uncertain world
Austin Tripp, Krzysztof Maziarz, Sarah Lewis, Marwin Segler, José Miguel Hernández-Lobato
TL;DR
The paper tackles retrosynthesis under uncertainty by framing reaction feasibility and buyability as binary stochastic processes and introducing the SSP metric to quantify the probability that at least one plan works. It then presents retro-fallback, a greedy algorithm that explicitly maximizes SSP by leveraging sampled realizations and a recursive, DP-enabled estimation of success likelihood. Across USPTO, GuacaMol, and FusionRetro benchmarks, retro-fallback outperforms baseline methods in SSP, illustrating the value of planning with backup options under uncertainty. The work highlights practical trade-offs, including slower runtime and the need for accurate feasibility models, and outlines directions for improving realism and scalability in lab-ready retrosynthetic planning.
Abstract
Retrosynthesis is the task of planning a series of chemical reactions to create a desired molecule from simpler, buyable molecules. While previous works have proposed algorithms to find optimal solutions for a range of metrics (e.g. shortest, lowest-cost), these works generally overlook the fact that we have imperfect knowledge of the space of possible reactions, meaning plans created by algorithms may not work in a laboratory. In this paper we propose a novel formulation of retrosynthesis in terms of stochastic processes to account for this uncertainty. We then propose a novel greedy algorithm called retro-fallback which maximizes the probability that at least one synthesis plan can be executed in the lab. Using in-silico benchmarks we demonstrate that retro-fallback generally produces better sets of synthesis plans than the popular MCTS and retro* algorithms.
