Do Two AI Scientists Agree?

Xinghong Fu; Ziming Liu; Max Tegmark

Do Two AI Scientists Agree?

Xinghong Fu, Ziming Liu, Max Tegmark

TL;DR

This work introduces MASS, a multi-physics AI framework that learns a scalar function $S$ per physical system whose derivatives define dynamics, allowing a single network to capture multiple theories across diverse systems. Through controlled experiments on SHO, pendulum, Kepler, and synthetic relativistic systems, MASS reveals that individual AI scientists can learn diverse explanations yet converge to a common underlying theory as data complexity increases, with a notable shift from Hamiltonian to Lagrangian representations in more complex regimes. The study demonstrates that multiple seeds (AI scientists) generally agree on the learned theory at the level of activations, even when exact weights differ, and shows that the Lagrangian description emerges as the dominant, unifying form in richer theory spaces. Extensions to higher dimensions, including the double pendulum and n-body problems, indicate MASS’s potential for interpretable, scalable AI-driven discovery of physical laws, albeit with computational considerations tied to Hessian inverses and training stability.

Abstract

When two AI models are trained on the same scientific task, do they learn the same theory or two different theories? Throughout history of science, we have witnessed the rise and fall of theories driven by experimental validation or falsification: many theories may co-exist when experimental data is lacking, but the space of survived theories become more constrained with more experimental data becoming available. We show the same story is true for AI scientists. With increasingly more systems provided in training data, AI scientists tend to converge in the theories they learned, although sometimes they form distinct groups corresponding to different theories. To mechanistically interpret what theories AI scientists learn and quantify their agreement, we propose MASS, Hamiltonian-Lagrangian neural networks as AI Scientists, trained on standard problems in physics, aggregating training results across many seeds simulating the different configurations of AI scientists. Our findings suggests for AI scientists switch from learning a Hamiltonian theory in simple setups to a Lagrangian formulation when more complex systems are introduced. We also observe strong seed dependence of the training dynamics and final learned weights, controlling the rise and fall of relevant theories. We finally demonstrate that not only can our neural networks aid interpretability, it can also be applied to higher dimensional problems.

Do Two AI Scientists Agree?

TL;DR

This work introduces MASS, a multi-physics AI framework that learns a scalar function

per physical system whose derivatives define dynamics, allowing a single network to capture multiple theories across diverse systems. Through controlled experiments on SHO, pendulum, Kepler, and synthetic relativistic systems, MASS reveals that individual AI scientists can learn diverse explanations yet converge to a common underlying theory as data complexity increases, with a notable shift from Hamiltonian to Lagrangian representations in more complex regimes. The study demonstrates that multiple seeds (AI scientists) generally agree on the learned theory at the level of activations, even when exact weights differ, and shows that the Lagrangian description emerges as the dominant, unifying form in richer theory spaces. Extensions to higher dimensions, including the double pendulum and n-body problems, indicate MASS’s potential for interpretable, scalable AI-driven discovery of physical laws, albeit with computational considerations tied to Hessian inverses and training stability.

Do Two AI Scientists Agree?

TL;DR

Abstract

Do Two AI Scientists Agree?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (19)