AutoSciLab: A Self-Driving Laboratory For Interpretable Scientific Discovery
Saaketh Desai, Sadhvikas Addamane, Jeffrey Y. Tsao, Igal Brener, Laura P. Swiler, Remi Dingreville, Prasad P. Iyer
TL;DR
AutoSciLab addresses the bottleneck of human-intuition-driven discovery in high-dimensional scientific spaces by integrating a VAE-based generative model, an active-learning loop with Gaussian processes, a directional autoencoder to embed prior physics into a low-dimensional latent space, and a neural-network-based equation learner to express results as human-readable equations. The framework demonstrates that autonomous exploration can rediscover classical physics (projectile motion and Ising magnetization) and uncover novel principles in nanophotonics, while providing interpretable mappings between latent variables and physical quantities. It reports substantial reductions in experimental effort, increased diversity of candidate experiments, and actionable, symbolic insights that tie latent variables to measurable outcomes, positioning AutoSciLab as a scalable path toward automated, interpretable scientific discovery in noisy, expensive domains. The work also discusses limitations, including dependence on curated priors and training data, and outlines a generalizable workflow poised for broad application in physical sciences and engineering.
Abstract
Advances in robotic control and sensing have propelled the rise of automated scientific laboratories capable of high-throughput experiments. However, automated scientific laboratories are currently limited by human intuition in their ability to efficiently design and interpret experiments in high-dimensional spaces, throttling scientific discovery. We present AutoSciLab, a machine learning framework for driving autonomous scientific experiments, forming a surrogate researcher purposed for scientific discovery in high-dimensional spaces. AutoSciLab autonomously follows the scientific method in four steps: (i) generating high-dimensional experiments (x \in R^D) using a variational autoencoder (ii) selecting optimal experiments by forming hypotheses using active learning (iii) distilling the experimental results to discover relevant low-dimensional latent variables (z \in R^d, with d << D) with a 'directional autoencoder' and (iv) learning a human interpretable equation connecting the discovered latent variables with a quantity of interest (y = f(z)), using a neural network equation learner. We validate the generalizability of AutoSciLab by rediscovering a) the principles of projectile motion and b) the phase transitions within the spin-states of the Ising model (NP-hard problem). Applying our framework to an open-ended nanophotonics challenge, AutoSciLab uncovers a fundamentally novel method for directing incoherent light emission that surpasses the current state-of-the-art (Iyer et al. 2023b, 2020).
