Curious Causality-Seeking Agents Learn Meta Causal World
Zhiyu Zhao, Haoxuan Li, Haifeng Zhang, Jun Wang, Francesco Faccio, Jürgen Schmidhuber, Mengyue Yang
TL;DR
This work tackles the instability of traditional world models under distributional shifts by introducing a Meta-Causal Graph (MG) that encodes multiple context-specific causal subgraphs activated by latent meta states. A Curious Causality-Seeking Agent actively intervenes to discover meta states and their causal structures, using interventions guided by curiosity rewards (e.g., edge-entropy) and a VQ-VAE-based representation to learn the MG. The authors establish identifiability results for meta states and causal subgraphs under mixed data and interventional conditions, and they validate the approach on synthetic tasks and a robot-arm manipulation task, showing improved prediction accuracy and downstream performance compared to static baselines. This framework advances open-ended world modeling by combining active causal discovery with minimal, robust representations and principled intervention strategies, enabling better generalization across unseen contexts.
Abstract
When building a world model, a common assumption is that the environment has a single, unchanging underlying causal rule, like applying Newton's laws to every situation. In reality, what appears as a drifting causal mechanism is often the manifestation of a fixed underlying mechanism seen through a narrow observational window. This brings about a problem that, when building a world model, even subtle shifts in policy or environment states can alter the very observed causal mechanisms. In this work, we introduce the \textbf{Meta-Causal Graph} as world models, a minimal unified representation that efficiently encodes the transformation rules governing how causal structures shift across different latent world states. A single Meta-Causal Graph is composed of multiple causal subgraphs, each triggered by meta state, which is in the latent state space. Building on this representation, we introduce a \textbf{Causality-Seeking Agent} whose objectives are to (1) identify the meta states that trigger each subgraph, (2) discover the corresponding causal relationships by agent curiosity-driven intervention policy, and (3) iteratively refine the Meta-Causal Graph through ongoing curiosity-driven exploration and agent experiences. Experiments on both synthetic tasks and a challenging robot arm manipulation task demonstrate that our method robustly captures shifts in causal dynamics and generalizes effectively to previously unseen contexts.
