Scalable Agentic Reasoning for Designing Biologics Targeting Intrinsically Disordered Proteins
Matthew Sinclair, Moeen Meigooni, Archit Vasan, Ozan Gokdemir, Xinran Lian, Heng Ma, Yadu Babuji, Alexander Brace, Khalid Hossain, Carlo Siebenschuh, Thomas Brettin, Kyle Chard, Christopher Henry, Venkatram Vishwanath, Rick L. Stevens, Ian T. Foster, Arvind Ramanathan
TL;DR
IDPs pose a major drug design challenge due to conformational heterogeneity and lack of stable pockets. StructBioReasoner introduces a scalable tournament-based multi-agent system that autonomously designs biologics for IDPs by integrating retrieval-augmented reasoning, structure prediction, MD simulation, and binder design under the Academy HPC middleware. The framework achieves high design success on Der f 21 and reveals multiple binding modes for NMNAT-2, including NMNAT-2:p53, demonstrating competitive performance with human-guided workflows. The work establishes a path toward exascale autonomous discovery of IDP-targeting therapeutics, highlighting solid scaling on Aurora and outlining concrete directions to address I/O bottlenecks for future large-scale deployments.
Abstract
Intrinsically disordered proteins (IDPs) represent crucial therapeutic targets due to their significant role in disease -- approximately 80\% of cancer-related proteins contain long disordered regions -- but their lack of stable secondary/tertiary structures makes them "undruggable". While recent computational advances, such as diffusion models, can design high-affinity IDP binders, translating these to practical drug discovery requires autonomous systems capable of reasoning across complex conformational ensembles and orchestrating diverse computational tools at scale.To address this challenge, we designed and implemented StructBioReasoner, a scalable multi-agent system for designing biologics that can be used to target IDPs. StructBioReasoner employs a novel tournament-based reasoning framework where specialized agents compete to generate and refine therapeutic hypotheses, naturally distributing computational load for efficient exploration of the vast design space. Agents integrate domain knowledge with access to literature synthesis, AI-structure prediction, molecular simulations, and stability analysis, coordinating their execution on HPC infrastructure via an extensible federated agentic middleware, Academy. We benchmark StructBioReasoner across Der f 21 and NMNAT-2 and demonstrate that over 50\% of 787 designed and validated candidates for Der f 21 outperformed the human-designed reference binders from literature, in terms of improved binding free energy. For the more challenging NMNAT-2 protein, we identified three binding modes from 97,066 binders, including the well-studied NMNAT2:p53 interface. Thus, StructBioReasoner lays the groundwork for agentic reasoning systems for IDP therapeutic discovery on Exascale platforms.
