FREED++: Improving RL Agents for Fragment-Based Molecule Generation by Thorough Reproduction
Alexander Telepov, Artem Tsypin, Kuzma Khrabrov, Sergey Yakukhnov, Pavel Strashnov, Petr Zhilyaev, Egor Rumiantsev, Daniel Ezhov, Manvel Avetisian, Olga Popova, Artur Kadurin
TL;DR
This work assesses protein-conditioned fragment-based molecule generation via RL, focusing on reproducing and fixing the FREED framework. It identifies critical implementation bugs, proposes FFREED as a corrected baseline, and introduces FREED++ as a streamlined, faster variant with ablations. Across extensive experiments on multiple protein targets and fragment libraries, FREED++ consistently achieves superior docking-score performance compared with baselines, while offering better generalization and stability. The study emphasizes reproducibility, broader evaluation, and practical applicability to USP7 inhibitors, highlighting the importance of library design and robust RL components for drug discovery pipelines.
Abstract
A rational design of new therapeutic drugs aims to find a molecular structure with desired biological functionality, e.g., an ability to activate or suppress a specific protein via binding to it. Molecular docking is a common technique for evaluating protein-molecule interactions. Recently, Reinforcement Learning (RL) has emerged as a promising approach to generating molecules with the docking score (DS) as a reward. In this work, we reproduce, scrutinize and improve the recent RL model for molecule generation called FREED (arXiv:2110.01219). Extensive evaluation of the proposed method reveals several limitations and challenges despite the outstanding results reported for three target proteins. Our contributions include fixing numerous implementation bugs and simplifying the model while increasing its quality, significantly extending experiments, and conducting an accurate comparison with current state-of-the-art methods for protein-conditioned molecule generation. We show that the resulting fixed model is capable of producing molecules with superior docking scores compared to alternative approaches.
