Inducing Causal World Models in LLMs for Zero-Shot Physical Reasoning
Aditya Sharma, Ananya Gupta, Chengyu Wang, Chiamaka Adebayo, Jakub Kowalski
TL;DR
This work addresses the limitation that LLMs lack grounded physical understanding by introducing Causal World Model Induction (CWMI), which embeds a Causal Physics Module (CPM) inside a frozen LLM and trains it with a Causal Intervention Loss. The CPM acts as a latent-space physics engine, learning causal dynamics through state-prediction and counterfactual reasoning, enabling zero-shot physical QA that leverages internal simulation. On PIQA and the newly proposed PhysiCa-Bench, CWMI substantially outperforms baselines, with high CCS and low FSPA, demonstrating that explicit causal world modeling yields more robust, generalizable physical reasoning. The work highlights the value of architectural modularity (LLM as language interface, CPM as physics engine) and skillful use of multimodal data, offering practical gains for safe, reliable embodied AI systems.
Abstract
Large Language Models (LLMs), despite their advanced linguistic capabilities, fundamentally lack an intuitive understanding of physical dynamics, which limits their effectiveness in real-world scenarios that require causal reasoning. In this paper, we introduce Causal World Model Induction (CWMI), a novel framework designed to embed an explicit model of causal physics within an LLM. Our approach incorporates a dedicated Causal Physics Module (CPM) and a new training objective called Causal Intervention Loss, encouraging the model to learn cause-and-effect relationships from multimodal data. By training the model to predict the outcomes of hypothetical interventions instead of merely capturing statistical correlations, CWMI develops a robust internal representation of physical laws. Experimental results show that CWMI significantly outperforms state-of-the-art LLMs on zero-shot physical reasoning tasks, including the PIQA benchmark and our newly proposed PhysiCa-Bench dataset. These findings demonstrate that inducing a causal world model is a critical step toward more reliable and generalizable AI systems.
