Genetic-guided GFlowNets for Sample Efficient Molecular Optimization
Hyeonah Kim, Minsu Kim, Sanghyeok Choi, Jinkyoo Park
TL;DR
This work tackles the challenge of sample-efficient de novo molecular optimization where reward evaluations are expensive. It introduces Genetic GFN, which distills domain knowledge from genetic algorithms into a deep generative policy via off-policy GFlowNets, enhanced by unsupervised pretraining and a KL-regularized TB objective. The method combines SMILES-based policy generation with graph-based genetic search, enabling diverse, high-reward molecule generation and robust policy updates through trajectory balance with rank-based replay. Empirically, Genetic GFN achieves state-of-the-art performance on the Practical Molecular Optimization benchmark and significantly reduces reward calls in SARS-CoV-2 inhibitor design, highlighting its practical impact for drug discovery and materials design through improved sample efficiency.
Abstract
The challenge of discovering new molecules with desired properties is crucial in domains like drug discovery and material design. Recent advances in deep learning-based generative methods have shown promise but face the issue of sample efficiency due to the computational expense of evaluating the reward function. This paper proposes a novel algorithm for sample-efficient molecular optimization by distilling a powerful genetic algorithm into deep generative policy using GFlowNets training, the off-policy method for amortized inference. This approach enables the deep generative policy to learn from domain knowledge, which has been explicitly integrated into the genetic algorithm. Our method achieves state-of-the-art performance in the official molecular optimization benchmark, significantly outperforming previous methods. It also demonstrates effectiveness in designing inhibitors against SARS-CoV-2 with substantially fewer reward calls.
