Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis
Xinyi Li, Sai Wang, Yutian Lin, Yu Wu, Yi Yang
TL;DR
Retro-Expert addresses interpretability gaps in retrosynthesis by integrating specialized models to build a chemical decision space and an LLM to perform knowledge-grounded reasoning, optimized via KGPO. The framework yields not only reactant predictions but also interpretable natural-language reasoning paths, bridging AI outputs with chemical principles. Empirical results on USPTO-50K/USPTO-FULL show superior accuracy over both LLM-based and non-LLM baselines and strong generalization, including an OOD ChemBench improvement. Wet-lab validation and PaRoutes experiments demonstrate practical utility and scalable reasoning.
Abstract
Retrosynthesis prediction aims to infer the reactant molecule based on a given product molecule, which is a fundamental task in chemical synthesis. However, existing models rely on static pattern-matching paradigm, which limits their ability to perform effective logic decision-making, leading to black-box decision-making. Building on this, we propose Retro-Expert, an interpretable retrosynthesis framework that performs collaborative reasoning by combining the complementary reasoning strengths of Large Language Models and specialized models via reinforcement learning. It outputs natural language explanations grounded in chemical logic through three components: (1) specialized models analyze the product to construct high-quality chemical decision space, (2) LLM-driven critical reasoning to generate predictions and corresponding interpretable reasoning path, and (3) reinforcement learning optimizing interpretable decision policy. Experiments show that Retro-Expert not only surpasses both LLM-based and specialized models across different metrics but also provides expert-aligned explanations that bridge the gap between AI predictions and actionable chemical insights.
