WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making
Guillaume Levy, Cedric Colas, Pierre-Yves Oudeyer, Thomas Carta, Clement Romac
TL;DR
WorldLLM tackles grounded world modeling for LLMs by injecting natural-language hypotheses into a forward-model prompt and iteratively refining these hypotheses through Bayesian inference, while driving informative data collection with curiosity-driven RL. The framework couples a Statistician (LLM-based forward model), a Scientist (hypothesis generator via Metropolis-like Bayesian updates), and an Experimenter (oracle or RL-driven data collector) to autonomously improve predictions without gradient-based LLM fine-tuning. In Playground-Text experiments, WorldLLM improves predictive accuracy and yields interpretable theories of environment dynamics, outperforming vanilla prompting and providing a principled alternative to fine-tuning. The study demonstrates the potential of combining in-context learning, theory induction, and intrinsic motivation to ground LLMs in domain-specific dynamics, while revealing challenges in generalization that guide future improvements in priors, hypothesis abstraction, and scalable architectures.
Abstract
Large Language Models (LLMs) possess general world knowledge but often struggle to generate precise predictions in structured, domain-specific contexts such as simulations. These limitations arise from their inability to ground their broad, unstructured understanding in specific environments. To address this, we present WorldLLM, a framework that enhances LLM-based world modeling by combining Bayesian inference and autonomous active exploration with reinforcement learning. WorldLLM leverages the in-context learning abilities of LLMs to guide an LLM-based world model's predictions using natural language hypotheses given in its prompt. These hypotheses are iteratively refined through a Bayesian inference framework that leverages a second LLM as the proposal distribution given collected evidence. This evidence is collected using a curiosity-driven reinforcement learning policy that explores the environment to find transitions with a low log-likelihood under our LLM-based predictive model using the current hypotheses. By alternating between refining hypotheses and collecting new evidence, our framework autonomously drives continual improvement of the predictions. Our experiments demonstrate the effectiveness of WorldLLM in a textual game environment that requires agents to manipulate and combine objects. The framework not only enhances predictive accuracy, but also generates human-interpretable theories of environment dynamics.
