A User-Tunable Machine Learning Framework for Step-Wise Synthesis Planning
Shivesh Prakash, Nandan Patel, Hans-Arno Jacobsen, Viki Kumar Prasad
TL;DR
MHNpath presents a user-tunable, ML-driven retrosynthetic framework that combines Modern Hopfield Network–based template prioritization with a global greedy tree-search strategy to generate multi-step, cost-aware, and environmentally friendly synthesis routes. The approach supports a multi-objective scoring system prioritizing precursor cost, reaction temperature, and solvent toxicity, enabling greener and more practical routes. Evaluations on PaRoutes and ChemByDesign show strong template-prioritization performance, high pathway replication of gold-standard routes, and the discovery of shorter, cheaper pathways (e.g., a three-step route for dronabinol at $0.12/g) through enzymatic-synthetic hybrids. Supplementary analyses provide detailed hyperparameter tuning, scaffold diversity, pathway visualizations, and case studies that demonstrate MHNpath’s capacity to replicate, improve, and extend established synthetic strategies.
Abstract
We introduce MHNpath, a machine learning-driven retrosynthetic tool designed for computer-aided synthesis planning. Leveraging modern Hopfield networks and novel comparative metrics, MHNpath efficiently prioritizes reaction templates, improving the scalability and accuracy of retrosynthetic predictions. The tool incorporates a tunable scoring system that allows users to prioritize pathways based on cost, reaction temperature, and toxicity, thereby facilitating the design of greener and cost-effective reaction routes. We demonstrate its effectiveness through case studies involving complex molecules from ChemByDesign, showcasing its ability to predict novel synthetic and enzymatic pathways. Furthermore, we benchmark MHNpath against existing frameworks using the PaRoutes dataset, achieving a solution rate of 85.4% and replicating 69.2% of experimentally validated "gold-standard" pathways. Our case studies reveal that the tool can generate shorter, cheaper, moderate-temperature routes employing green solvents, as exemplified by compounds such as dronabinol, arformoterol, and lupinine.
