Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
Lianhao Zhou, Hongyi Ling, Cong Fu, Yepeng Huang, Michael Sun, Wendi Yu, Xiaoxuan Wang, Xiner Li, Xingyu Su, Junkai Zhang, Xiusi Chen, Chenxing Liang, Xiaofeng Qian, Heng Ji, Wei Wang, Marinka Zitnik, Shuiwang Ji
TL;DR
The paper argues that large language model–based autonomous agents can orchestrate scientists, language, code, and physics to accelerate the scientific discovery lifecycle—from hypothesis discovery through experimental design and execution to result analysis and refinement. It introduces an information-theoretic framework centered on entropy, verifiability, and dissipation, and proposes a five-level autonomy ladder to measure agent capability across discovery phases. It surveys knowledge extraction, hypothesis generation, experimental design/execution, and result analysis, and discusses tool use and tool creation as core operational modes, including domain-specific agents and multi-agent collaboration. The work also addresses challenges in agentic reinforcement learning, environment interaction with physical tools, and the role of serendipity, offering directions for building more robust, generalizable, and adaptive scientific agents with broad practical impact across disciplines.
Abstract
Computing has long served as a cornerstone of scientific discovery. Recently, a paradigm shift has emerged with the rise of large language models (LLMs), introducing autonomous systems, referred to as agents, that accelerate discovery across varying levels of autonomy. These language agents provide a flexible and versatile framework that orchestrates interactions with human scientists, natural language, computer language and code, and physics. This paper presents our view and vision of LLM-based scientific agents and their growing role in transforming the scientific discovery lifecycle, from hypothesis discovery, experimental design and execution, to result analysis and refinement. We critically examine current methodologies, emphasizing key innovations, practical achievements, and outstanding limitations. Additionally, we identify open research challenges and outline promising directions for building more robust, generalizable, and adaptive scientific agents. Our analysis highlights the transformative potential of autonomous agents to accelerate scientific discovery across diverse domains.
