Towards autonomous quantum physics research using LLM agents with access to intelligent tools
Sören Arlt, Xuemei Gu, Mario Krenn
TL;DR
Problem: Automating the generation of novel research directions and their experimental realization remains a major challenge in science. Approach: AI-Mandel uses two coupled LLM-agent systems with access to literature and PyTheus to autonomously propose quantum-physics ideas and translate them into executable experiment configurations. Findings: The system generated 187 ideas, with 184 implemented at least once and 804 total implementations (739 successful), including seven highlighted concepts and two independent publishable papers. Significance: This work demonstrates a concrete path toward AI-driven scientific discovery, while outlining practical challenges and future directions toward more general, interpretable, and autonomous artificial scientists.
Abstract
Artificial intelligence (AI) is used in numerous fields of science, yet the initial research questions and targets are still almost always provided by human researchers. AI-generated creative ideas in science are rare and often vague, so that it remains a human task to execute them. Automating idea generation and implementation in one coherent system would significantly shift the role of humans in the scientific process. Here we present AI-Mandel, an LLM agent that can generate and implement ideas in quantum physics. AI-Mandel formulates ideas from the literature and uses a domain-specific AI tool to turn them into concrete experiment designs that can readily be implemented in laboratories. The generated ideas by AI-Mandel are often scientifically interesting - for two of them we have already written independent scientific follow-up papers. The ideas include new variations of quantum teleportation, primitives of quantum networks in indefinite causal orders, and new concepts of geometric phases based on closed loops of quantum information transfer. AI-Mandel is a prototypical demonstration of an AI physicist that can generate and implement concrete, actionable ideas. Building such a system is not only useful to accelerate science, but it also reveals concrete open challenges on the path to human-level artificial scientists.
