Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

Amirtha Varshini A S; Duminda S. Ranasinghe; Hok Hei Tam

Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

Amirtha Varshini A S, Duminda S. Ranasinghe, Hok Hei Tam

TL;DR

The paper addresses the opacity of decision policies in Generative Flow Networks (GFlowNets) applied to drug design by introducing an interpretability toolkit for SynFlowNet trained with the reward $QED$. It combines gradient-based saliency, counterfactual perturbations, sparse autoencoders, and motif probes to connect atomic-level decisions to interpretable chemical concepts. The results show atom-level saliency aligns with chemically meaningful regions, latent factors capture polarity and size with strong $R^2$ correlations, and motif probes reveal linearly decodable functional groups, collectively enabling mechanistic insight into GFlowNet decision-making. These findings support transparent, controllable molecular design and point toward conditioning generative policies on interpretable physicochemical axes for improved alignment with medicinal chemistry reasoning.

Abstract

Generative Flow Networks, or GFlowNets, offer a promising framework for molecular design, but their internal decision policies remain opaque. This limits adoption in drug discovery, where chemists require clear and interpretable rationales for proposed structures. We present an interpretability framework for SynFlowNet, a GFlowNet trained on documented chemical reactions and purchasable starting materials that generates both molecules and the synthetic routes that produce them. Our approach integrates three complementary components. Gradient based saliency combined with counterfactual perturbations identifies which atomic environments influence reward and how structural edits change molecular outcomes. Sparse autoencoders reveal axis aligned latent factors that correspond to physicochemical properties such as polarity, lipophilicity, and molecular size. Motif probes show that functional groups including aromatic rings and halogens are explicitly encoded and linearly decodable from the internal embeddings. Together, these results expose the chemical logic inside SynFlowNet and provide actionable and mechanistic insight that supports transparent and controllable molecular design.

Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

TL;DR

The paper addresses the opacity of decision policies in Generative Flow Networks (GFlowNets) applied to drug design by introducing an interpretability toolkit for SynFlowNet trained with the reward

. It combines gradient-based saliency, counterfactual perturbations, sparse autoencoders, and motif probes to connect atomic-level decisions to interpretable chemical concepts. The results show atom-level saliency aligns with chemically meaningful regions, latent factors capture polarity and size with strong

correlations, and motif probes reveal linearly decodable functional groups, collectively enabling mechanistic insight into GFlowNet decision-making. These findings support transparent, controllable molecular design and point toward conditioning generative policies on interpretable physicochemical axes for improved alignment with medicinal chemistry reasoning.

Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

TL;DR

Abstract

Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)