Agentic Exploration of Physics Models
Maximilian Nägele, Florian Marquardt
TL;DR
SciExplorer demonstrates that a generalist LLM agent, empowered by tool use and external memory, can autonomously drive the heuristic loop of scientific discovery across mechanical, wave, and quantum domains without task-specific tuning. By integrating a minimal system prompt with flexible Python-code execution, plotting, and domain simulators, the agent designs experiments, analyzes results, and forms hypotheses to recover governing equations and Hamiltonians with high fidelity. Across mechanical dynamics, field/wave evolution, and quantum spin models, the approach yields near-perfect fits and robust out-of-sample validations, suggesting strong potential for domain-general scientific exploration and rapid hypothesis testing in experimental contexts. The work points to broad applicability beyond physics, enabling automated discovery, phase-diagram mapping, and control optimization in unknown or partially known systems.
Abstract
The process of scientific discovery relies on an interplay of observations, analysis, and hypothesis generation. Machine learning is increasingly being adopted to address individual aspects of this process. However, it remains an open challenge to fully automate the heuristic, iterative loop required to discover the laws of an unknown system by exploring it through experiments and analysis, without tailoring the approach to the specifics of a given task. Here, we introduce SciExplorer, an agent that leverages large language model tool-use capabilities to enable exploration of systems without any domain-specific blueprints, and apply it to physical systems that are initially unknown to the agent. We test SciExplorer on a broad set of models spanning mechanical dynamical systems, wave evolution, and quantum many-body physics. Despite using a minimal set of tools, primarily based on code execution, we observe impressive performance on tasks such as recovering equations of motion from observed dynamics and inferring Hamiltonians from expectation values. The demonstrated effectiveness of this setup opens the door towards similar scientific exploration in other domains, without the need for finetuning or task-specific instructions.
