SynthLens: Visual Analytics for Facilitating Multi-step Synthetic Route Design
Qipeng Wang, Rui Sheng, Shaolun Ruan, Xiaofu Jin, Chuhan Shi, Min Zhu
TL;DR
SynthLens presents a visual analytics system for multi-step synthetic route design by integrating literature-derived reactions into a tree-structured workflow. It combines a tree-form route overview, multiple views for paper exploration, molecule similarity, and weighted ranking to support sequential decision-making with both quantitative and qualitative criteria. The approach employs PubMedBERT embeddings for paper clustering and an Eunomia chemist AI agent for extracting reaction details, with a Context Relevancy metric to gauge extraction quality. Case studies and expert interviews demonstrate improved workflow efficiency and decision quality, suggesting broad applicability to retrosynthesis and other multi-criteria decision tasks in chemistry.
Abstract
Designing synthetic routes for novel molecules is pivotal in various fields like medicine and chemistry. In this process, researchers need to explore a set of synthetic reactions to transform starting molecules into intermediates step by step until the target novel molecule is obtained. However, designing synthetic routes presents challenges for researchers. First, researchers need to make decisions among numerous possible synthetic reactions at each step, considering various criteria (e.g., yield, experimental duration, and the count of experimental steps) to construct the synthetic route. Second, they must consider the potential impact of one choice at each step on the overall synthetic route. To address these challenges, we proposed SynthLens, a visual analytics system to facilitate the iterative construction of synthetic routes by exploring multiple possibilities for synthetic reactions at each step of construction. Specifically, we have introduced a tree-form visualization in SynthLens to compare and evaluate all the explored routes at various exploration steps, considering both the exploration step and multiple criteria. Our system empowers researchers to consider their construction process comprehensively, guiding them toward promising exploration directions to complete the synthetic route. We validated the usability and effectiveness of SynthLens through a quantitative evaluation and expert interviews, highlighting its role in facilitating the design process of synthetic routes. Finally, we discussed the insights of SynthLens to inspire other multi-criteria decision-making scenarios with visual analytics.
