Synthelite: Chemist-aligned and feasibility-aware synthesis planning with LLMs
Nguyen Xuan-Vu, Daniel Armstrong, Milena Wehrbach, Andres M Bran, Zlatko Jončev, Philippe Schwaller
TL;DR
CASP faces challenges in integrating chemist feedback and assessing route feasibility. Synthelite answers with a two-phase framework where LLMs draft retrosynthetic blueprints (Phase 1) and a similarity-based Monte Carlo Tree Search realizes routes aligned with stock constraints (Phase 2). It demonstrates high steerability to expert prompts, effective handling of starting-material constraints, and feasibility-aware route design, achieving competitive solve rates on USPTO benchmarks. The work highlights a practical path toward LLM-centric orchestration of synthesis planning while acknowledging current limitations of closed LLMs and template-matching gaps. Overall, Synthelite represents a significant step toward interactive, chemistry-grounded, LLM-guided CASP tooling with real-world potential.
Abstract
Computer-aided synthesis planning (CASP) has long been envisioned as a complementary tool for synthetic chemists. However, existing frameworks often lack mechanisms to allow interaction with human experts, limiting their ability to integrate chemists' insights. In this work, we introduce Synthelite, a synthesis planning framework that uses large language models (LLMs) to directly propose retrosynthetic transformations. Synthelite can generate end-to-end synthesis routes by harnessing the intrinsic chemical knowledge and reasoning capabilities of LLMs, while allowing expert intervention through natural language prompts. Our experiments demonstrate that Synthelite can flexibly adapt its planning trajectory to diverse user-specified constraints, achieving up to 95\% success rates in both strategy-constrained and starting-material-constrained synthesis tasks. Additionally, Synthelite exhibits the ability to account for chemical feasibility during route design. We envision Synthelite to be both a useful tool and a step toward a paradigm where LLMs are the central orchestrators of synthesis planning.
