Table of Contents
Fetching ...

Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds

Gaël Gendron, Jože M. Rožanec, Michael Witbrock, Gillian Dobbie

TL;DR

The paper presents Causal Cartographer, a dual-framework that combines a graph retrieval-augmented causal extraction component with a step-by-step counterfactual reasoning agent to enable robust reasoning over counterfactual worlds. By constructing CausalWorld from real-world news (e.g., oil-related events), it creates a repository of causal knowledge with 975 nodes and 1337 edges to guide counterfactual inference under causal constraints. A dedicated CTG-Reason agent performs abduction-intervention-prediction in a way that respects causal structure, achieving competitive accuracy while greatly reducing context size and inference cost compared to standard chain-of-thought baselines. The approach demonstrates practical benefits for evaluating and improving LLMs on real-world counterfactual tasks and highlights potential societal impacts, including mitigations against bias and the importance of data reliability. Overall, the work advances causal reasoning in AI by linking explicit causal knowledge extraction with efficient, structured counterfactual inference.

Abstract

Causal world models are systems that can answer counterfactual questions about an environment of interest, i.e. predict how it would have evolved if an arbitrary subset of events had been realized differently. It requires understanding the underlying causes behind chains of events and conducting causal inference for arbitrary unseen distributions. So far, this task eludes foundation models, notably large language models (LLMs), which do not have demonstrated causal reasoning capabilities beyond the memorization of existing causal relationships. Furthermore, evaluating counterfactuals in real-world applications is challenging since only the factual world is observed, limiting evaluation to synthetic datasets. We address these problems by explicitly extracting and modeling causal relationships and propose the Causal Cartographer framework. First, we introduce a graph retrieval-augmented generation agent tasked to retrieve causal relationships from data. This approach allows us to construct a large network of real-world causal relationships that can serve as a repository of causal knowledge and build real-world counterfactuals. In addition, we create a counterfactual reasoning agent constrained by causal relationships to perform reliable step-by-step causal inference. We show that our approach can extract causal knowledge and improve the robustness of LLMs for causal reasoning tasks while reducing inference costs and spurious correlations.

Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds

TL;DR

The paper presents Causal Cartographer, a dual-framework that combines a graph retrieval-augmented causal extraction component with a step-by-step counterfactual reasoning agent to enable robust reasoning over counterfactual worlds. By constructing CausalWorld from real-world news (e.g., oil-related events), it creates a repository of causal knowledge with 975 nodes and 1337 edges to guide counterfactual inference under causal constraints. A dedicated CTG-Reason agent performs abduction-intervention-prediction in a way that respects causal structure, achieving competitive accuracy while greatly reducing context size and inference cost compared to standard chain-of-thought baselines. The approach demonstrates practical benefits for evaluating and improving LLMs on real-world counterfactual tasks and highlights potential societal impacts, including mitigations against bias and the importance of data reliability. Overall, the work advances causal reasoning in AI by linking explicit causal knowledge extraction with efficient, structured counterfactual inference.

Abstract

Causal world models are systems that can answer counterfactual questions about an environment of interest, i.e. predict how it would have evolved if an arbitrary subset of events had been realized differently. It requires understanding the underlying causes behind chains of events and conducting causal inference for arbitrary unseen distributions. So far, this task eludes foundation models, notably large language models (LLMs), which do not have demonstrated causal reasoning capabilities beyond the memorization of existing causal relationships. Furthermore, evaluating counterfactuals in real-world applications is challenging since only the factual world is observed, limiting evaluation to synthetic datasets. We address these problems by explicitly extracting and modeling causal relationships and propose the Causal Cartographer framework. First, we introduce a graph retrieval-augmented generation agent tasked to retrieve causal relationships from data. This approach allows us to construct a large network of real-world causal relationships that can serve as a repository of causal knowledge and build real-world counterfactuals. In addition, we create a counterfactual reasoning agent constrained by causal relationships to perform reliable step-by-step causal inference. We show that our approach can extract causal knowledge and improve the robustness of LLMs for causal reasoning tasks while reducing inference costs and spurious correlations.

Paper Structure

This paper contains 38 sections, 1 theorem, 8 equations, 25 figures.

Key Result

Theorem 1

Suppose that a causal blanket $\mathcal{B}_c$ is K-matched with $\mathcal{O}_o$ over a variable $T$ and that $\mathcal{O}_o \setminus \mathcal{O}_s$ forms a causal blanket over each variable of $\mathcal{O}_s$. Then, observing $\mathcal{B}_c$ or observing $\mathcal{O}_o \setminus \mathcal{O}_s$ and

Figures (25)

  • Figure 1: Counterfactual twin graph for three endogenous variables $\{\textcolor{Cyan}{X}, \textcolor{Cyan}{Y}, \textcolor{Cyan}{Z}\}$ and two exogenous variables $\{\textcolor{red}{U},\textcolor{red}{V}\}$, under an intervention over $\textcolor{Orange}{X}$. A counterfactual world is identical to the factual world, except for the intervention and its effects. Exogenous variables are shared by both worlds, connecting the two graphs. $\textcolor{Orange}{X}$ is being intervened upon and so has no incoming edges. The counterfactual value of $\textcolor{Green}{Y}$ given by $P(\textcolor{Green}{Y} \mid \textcolor{Orange}{do(X)},\textcolor{Cyan}{X'},\textcolor{Cyan}{Y'})$ is obtained by estimating the value of $\textcolor{red}{U}$ from the factual observations, then deducing $\textcolor{Green}{Z}$ from $\textcolor{red}{U}$ (left unchanged) and $\textcolor{Green}{Y}$ from $\textcolor{Green}{Z}$ and $\textcolor{Orange}{X}$.
  • Figure 2: Overview of the Causal Cartographer Extraction agent (CTG-Extract). (a) Illustration of the extraction pipeline. The title, content, and metadata from a news source are provided to the agent context. It then generates causal variables and their relationships and adds them to the causal world model. A graphRAG system further compares article data and agent node suggestions and adds to the agent context the nodes and causal relationships that seem related. (b) Example of a node extracted with the method. Its name, description, type, possible values, and the worlds it appears in are shown.
  • Figure 3: CausalWorld graph structure. Colors show the cluster in which the node belongs. Node size is based on the number of worlds a node appears in: the more a node appears, the bigger it is on the visualization. Labels correspond to the name of the most representative node of the cluster.
  • Figure 4: Illustration of a direct causal path in the CausalWorld graph. Nodes are blue boxes. Arrows represent causal dependencies. The description of the dependency is shown in dashed red boxes. Note that all nodes except for the root can have additional causal parents not shown in the chain and that the strength and function related to the causal relationships are not shown.
  • Figure 5: Illustration of a cycle in the CausalWorld graph. The legend is the same as in Figure \ref{['fig:causal-chain-ex']}.
  • ...and 20 more figures

Theorems & Definitions (4)

  • Definition 1: Causal Blanket
  • Definition 2: K-Matching
  • Theorem 1: K-Matching Equivalence
  • proof