Table of Contents
Fetching ...

Retrieving Classes of Causal Orders with Inconsistent Knowledge Bases

Federico Baldo, Simon Ferreira, Charles K. Assaad

Abstract

Traditional causal discovery methods often depend on strong, untestable assumptions, making them unreliable in real-world applications. In this context, Large Language Models (LLMs) have emerged as a promising alternative for extracting causal knowledge from text-based metadata, effectively consolidating domain expertise. However, LLMs are prone to hallucinations, necessitating strategies that account for these limitations. One effective approach is to use a consistency measure as a proxy of reliability. Moreover, LLMs do not clearly distinguish direct from indirect causal relationships, complicating the discovery of causal Directed Acyclic Graphs (DAGs), which are often sparse. This ambiguity is evident in the way informal sentences are formulated in various domains. For this reason, focusing on causal orders provides a more practical and direct task for LLMs. We propose a new method for deriving abstractions of causal orders that maximizes a consistency score obtained from an LLM. Our approach begins by computing pairwise consistency scores between variables, from which we construct a semi-complete partially directed graph that consolidates these scores into an abstraction. Using this structure, we identify both a maximally oriented partially directed acyclic graph and an optimal set of acyclic tournaments that maximize consistency across all configurations. We further demonstrate how both the abstraction and the class of causal orders can be used to estimate causal effects. We evaluate our method on a wide set of causal DAGs extracted from scientific literature in epidemiology and public health. Our results show that the proposed approach can effectively recover the correct causal order, providing a reliable and practical LLM-assisted causal framework.

Retrieving Classes of Causal Orders with Inconsistent Knowledge Bases

Abstract

Traditional causal discovery methods often depend on strong, untestable assumptions, making them unreliable in real-world applications. In this context, Large Language Models (LLMs) have emerged as a promising alternative for extracting causal knowledge from text-based metadata, effectively consolidating domain expertise. However, LLMs are prone to hallucinations, necessitating strategies that account for these limitations. One effective approach is to use a consistency measure as a proxy of reliability. Moreover, LLMs do not clearly distinguish direct from indirect causal relationships, complicating the discovery of causal Directed Acyclic Graphs (DAGs), which are often sparse. This ambiguity is evident in the way informal sentences are formulated in various domains. For this reason, focusing on causal orders provides a more practical and direct task for LLMs. We propose a new method for deriving abstractions of causal orders that maximizes a consistency score obtained from an LLM. Our approach begins by computing pairwise consistency scores between variables, from which we construct a semi-complete partially directed graph that consolidates these scores into an abstraction. Using this structure, we identify both a maximally oriented partially directed acyclic graph and an optimal set of acyclic tournaments that maximize consistency across all configurations. We further demonstrate how both the abstraction and the class of causal orders can be used to estimate causal effects. We evaluate our method on a wide set of causal DAGs extracted from scientific literature in epidemiology and public health. Our results show that the proposed approach can effectively recover the correct causal order, providing a reliable and practical LLM-assisted causal framework.

Paper Structure

This paper contains 35 sections, 8 theorems, 7 equations, 17 figures, 11 tables, 1 algorithm.

Key Result

Proposition 1

The maximally consistent semi-complete PDG, $\mathcal{S}$, does not contain directed cycles if the consistency matrix is provided by a consistent knowledge base.

Figures (17)

  • Figure 1: (a) Consistency Matrix. (b) Maximally consistent Semi-Complete PDG. (c) MPDAG derived from the Semi-Complete PDG. (d) Maximally consistent acyclic tournament. In bold edges modified with respect to the Semi-Complete PDG.
  • Figure 2: Non-identifiable subgraphs. In bold, the undirected edges are related to the treatment variable.
  • Figure 3: Boxplot intraclass $\mathcal{D}_{top}$ of MATS using gpt-4.1-nano with different temperatures. Error is bounded to values higher than 0. Note that when the median is not visible signifies it is equal to 0.
  • Figure 4: Prompt used to query the LLM for rephrasing.
  • Figure 5: Prompt used to query the LLM for causal relationships.
  • ...and 12 more figures

Theorems & Definitions (24)

  • Definition 1: Causal Order
  • Definition 2: Acyclic tournament
  • Definition 3: Generalized Backdoor Criterion
  • Definition 4: Consistent Knowledge Base
  • Definition 5: Strictly Consistent Knowledge Base
  • Definition 6: Semi-Complete Partially Directed Graph
  • Proposition 1
  • Proposition 2
  • Proposition 3
  • Proposition 4
  • ...and 14 more