FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization
Yuto Suzuki, Paul Awolade, Daniel V. LaBarbera, Farnoush Banaei-Kashani
TL;DR
FRAGMENTA tackles data-scarce drug lead optimization by marrying LVSEF, a fragment-based generator that treats fragmentation as a vocabulary selection problem optimized via dynamic Q-learning, with an agentic AI system that automates expert feedback interpretation and model tuning. The two pillars create a closed-loop workflow that substantially increases high-quality lead candidates and reduces reliance on human engineers. Real-world cancer-target results show nearly double the number of docking-worthy molecules with a fully human-in-the-loop setup and strong performance gains even in fully autonomous Agent-Agent configurations. The work demonstrates a scalable path toward automated, expert-aligned molecular design, with practical impact for rapid lead optimization in drug discovery.
Abstract
Molecule generation using generative AI is vital for drug discovery, yet class-specific datasets often contain fewer than 100 training examples. While fragment-based models handle limited data better than atom-based approaches, existing heuristic fragmentation limits diversity and misses key fragments. Additionally, model tuning typically requires slow, indirect collaboration between medicinal chemists and AI engineers. We introduce FRAGMENTA, an end-to-end framework for drug lead optimization comprising: 1) a novel generative model that reframes fragmentation as a "vocabulary selection" problem, using dynamic Q-learning to jointly optimize fragmentation and generation; and 2) an agentic AI system that refines objectives via conversational feedback from domain experts. This system removes the AI engineer from the loop and progressively learns domain knowledge to eventually automate tuning. In real-world cancer drug discovery experiments, FRAGMENTA's Human-Agent configuration identified nearly twice as many high-scoring molecules as baselines. Furthermore, the fully autonomous Agent-Agent system outperformed traditional Human-Human tuning, demonstrating the efficacy of agentic tuning in capturing expert intent.
