Target Specific De Novo Design of Drug Candidate Molecules with Graph Transformer-based Generative Adversarial Networks
Atabey Ünlü, Elif Çevrim, Melih Gökay Yiğit, Ahmet Sarıgün, Hayriye Çelikbilek, Osman Bayram, Deniz Cansen Kahraman, Abdurrahman Olğaç, Ahmet Sureyya Rifaioğlu, Erden Banoğlu, Tunca Doğan
TL;DR
DrugGEN introduces a graph transformer-based GAN to enable target-centric de novo design of drug-like molecules, trained on ChEMBL-derived bioactive molecules and evaluated against AKT1 (and CDK2). The system uses real molecular graphs as generator input, enhanced edge-aware attention, and a Wasserstein GAN framework to generate inhibitors with favorable docking and drug-likeness properties. Comprehensive validation includes MOSES benchmarking, docking to AKT1/CDK2, DEEPScreen-based DTI predictions, dimensionality-reduction space analyses, and MD simulations, followed by synthesis and in vitro AKT1 inhibition with two compounds showing low micromolar activity. The work demonstrates target-centric generation within a scalable, open-source pipeline and discusses limitations of GAN training, outlining future directions toward broader targets, fragment-based construction, and integrated target features.
Abstract
Discovering novel drug candidate molecules is one of the most fundamental and critical steps in drug development. Generative deep learning models, which create synthetic data given a probability distribution, offer a high potential for designing de novo molecules. However, to be utilisable in real life drug development pipelines, these models should be able to design drug like and target centric molecules. In this study, we propose an end to end generative system, DrugGEN, for the de novo design of drug candidate molecules that interact with intended target proteins. The proposed method represents molecules as graphs and processes them via a generative adversarial network comprising graph transformer layers. The system is trained using a large dataset of drug like compounds and target specific bioactive molecules to design effective inhibitory molecules against the AKT1 protein, which is critically important in developing treatments for various types of cancer. We conducted molecular docking and dynamics to assess the target centric generation performance of the model, as well as attention score visualisation to examine model interpretability. In parallel, selected compounds were chemically synthesised and evaluated in the context of in vitro enzymatic assays, which identified two bioactive molecules that inhibited AKT1 at low micromolar concentrations. These results indicate that DrugGEN's de novo molecules have a high potential for interacting with the AKT1 protein at the level of its native ligands. Using the open access DrugGEN codebase, it is possible to easily train models for other druggable proteins, given a dataset of experimentally known bioactive molecules.
