ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, Simone Sciabola, Gianni De Fabritiis
TL;DR
ACEGEN addresses the challenge of efficiently exploring vast chemical space for drug design by delivering a modular toolkit built on TorchRL that combines reinforcement-learning agents with language-model–based molecular generators. It supports multiple CLM architectures, flexible scoring via MolScore/MolOpt, and constrained sampling modes (e.g., PromptSMILES, scaffold decoration), enabling de-novo, decorative, and fragment-linking generation. Through benchmarking on MolOpt, ablation studies of REINVENT components, and case studies including 5-HT$_{2A}$ docking and scaffold-constrained generation, the work provides practical guidance on reward design, regularization, and algorithm choice. The open-source ACEGEN platform, curated datasets, and detailed benchmarks promote reproducibility and accelerate adoption of RL-based drug discovery approaches in real-world pipelines.
Abstract
In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties. However, striking a balance between capabilities, flexibility, reliability, and efficiency remains challenging due to the complexity of advanced RL algorithms and the significant reliance on specialized code. In this work, we introduce ACEGEN, a comprehensive and streamlined toolkit tailored for generative drug design, built using TorchRL, a modern RL library that offers thoroughly tested reusable components. We validate ACEGEN by benchmarking against other published generative modeling algorithms and show comparable or improved performance. We also show examples of ACEGEN applied in multiple drug discovery case studies. ACEGEN is accessible at \url{https://github.com/acellera/acegen-open} and available for use under the MIT license.
