SimLM: Can Language Models Infer Parameters of Physical Systems?
Sean Memery, Mirella Lapata, Kartic Subr
TL;DR
The paper investigates whether Large Language Models can infer physical system parameters from observations, focusing on an inverse-physics task to estimate $(h,v)$ so the third bounce lands near a target. It introduces SimLM, a simulator-augmented prompting approach that interleaves physics simulation feedback with reasoning and self-critique, iterating up to $N=5$ times and leveraging past successful exemplars. In 2D experiments, SimLM improves over baseline CoT, with larger gains on harder, uneven terrains, and relative error dropping below $1$ in many cases; however, in a higher-dimensional 3D billiards task, all models show limited success and only marginal gains from simulation. The work demonstrates that grounding LLMs with physical simulators can enhance physics reasoning in 2D but also highlights current limits for complex 3D scenarios, pointing to simulator-grounded reasoning as a promising direction with needed advances for high-dimensional parameter inference.
Abstract
Several machine learning methods aim to learn or reason about complex physical systems. A common first-step towards reasoning is to infer system parameters from observations of its behavior. In this paper, we investigate the performance of Large Language Models (LLMs) at performing parameter inference in the context of physical systems. Our experiments suggest that they are not inherently suited to this task, even for simple systems. We propose a promising direction of exploration, which involves the use of physical simulators to augment the context of LLMs. We assess and compare the performance of different LLMs on a simple example with and without access to physical simulation.
