Table of Contents
Fetching ...

Towards Bounding Causal Effects under Markov Equivalence

Alexis Bellot

TL;DR

The paper addresses bounding causal effects when only observational data and a Partial Ancestral Graph (PAG) are available, i.e., under Markov equivalence rather than a fully specified diagram. It introduces a PAG-based partial identification framework (Partial IDP) that yields analytic lower and upper bounds on causal effects by exploiting invariances and a decomposition of $Q[\C]$ into $pc$-components and regions. It proves that these bounds can be tighter than Manski's natural bounds and provides algorithms to compute them, along with a discussion of their limitations and scalability. The work also investigates enumeration strategies over Loyal Equivalent Graphs (LEGs) and Maximally Bi-directed Diagrams (MBD), showing expressiveness equivalence with ME MAGs but highlighting substantial computational challenges for practical use in larger systems.

Abstract

Predicting the effect of unseen interventions is a fundamental research question across the data sciences. It is well established that in general such questions cannot be answered definitively from observational data. This realization has fuelled a growing literature introducing various identifying assumptions, for example in the form of a causal diagram among relevant variables. In practice, this paradigm is still too rigid for many practical applications as it is generally not possible to confidently delineate the true causal diagram. In this paper, we consider the derivation of bounds on causal effects given only observational data. We propose to take as input a less informative structure known as a Partial Ancestral Graph, which represents a Markov equivalence class of causal diagrams and is learnable from data. In this more ``data-driven'' setting, we provide a systematic algorithm to derive bounds on causal effects that exploit the invariant properties of the equivalence class, and that can be computed analytically. We demonstrate our method with synthetic and real data examples.

Towards Bounding Causal Effects under Markov Equivalence

TL;DR

The paper addresses bounding causal effects when only observational data and a Partial Ancestral Graph (PAG) are available, i.e., under Markov equivalence rather than a fully specified diagram. It introduces a PAG-based partial identification framework (Partial IDP) that yields analytic lower and upper bounds on causal effects by exploiting invariances and a decomposition of into -components and regions. It proves that these bounds can be tighter than Manski's natural bounds and provides algorithms to compute them, along with a discussion of their limitations and scalability. The work also investigates enumeration strategies over Loyal Equivalent Graphs (LEGs) and Maximally Bi-directed Diagrams (MBD), showing expressiveness equivalence with ME MAGs but highlighting substantial computational challenges for practical use in larger systems.

Abstract

Predicting the effect of unseen interventions is a fundamental research question across the data sciences. It is well established that in general such questions cannot be answered definitively from observational data. This realization has fuelled a growing literature introducing various identifying assumptions, for example in the form of a causal diagram among relevant variables. In practice, this paradigm is still too rigid for many practical applications as it is generally not possible to confidently delineate the true causal diagram. In this paper, we consider the derivation of bounds on causal effects given only observational data. We propose to take as input a less informative structure known as a Partial Ancestral Graph, which represents a Markov equivalence class of causal diagrams and is learnable from data. In this more ``data-driven'' setting, we provide a systematic algorithm to derive bounds on causal effects that exploit the invariant properties of the equivalence class, and that can be computed analytically. We demonstrate our method with synthetic and real data examples.
Paper Structure (14 sections, 15 theorems, 47 equations, 9 figures, 4 algorithms)

This paper contains 14 sections, 15 theorems, 47 equations, 9 figures, 4 algorithms.

Key Result

Proposition 1

Given a PAG $\mathcal{P}$ over $\V$ and $\A\subset\C\subseteq\V$, let the region of $\A$ with respect to $\C$ be denoted $\mathcal{R}_\A$. $Q[\C]$ can be decomposed as,

Figures (9)

  • Figure 1: Examples of diagrams.
  • Figure 2: PAG for \ref{['ex:steps']}.
  • Figure 3: (\ref{['fig:sachs:a']}) Protein signalling network sachs2005causal, (\ref{['fig:sachs:b']}) corresponding PAG for \ref{['ex:sachs']}.
  • Figure 4: Diagrams used in \ref{['sec:enumeration']}.
  • Figure 5: Diagrams used in \ref{['prop:nonredundancy_causal_graphs_in_leg']}.
  • ...and 4 more figures

Theorems & Definitions (44)

  • Definition 1: Causal effect
  • Definition 2: Partial Identification
  • Definition 3
  • Definition 4: $pc$-component jaber2018causal
  • Definition 5: Region jaber2019causal
  • Proposition 1: Thm. 1 jaber2019causal
  • Definition 6: Partial Identification from a PAG
  • Proposition 2
  • Proposition 3: Lower bound
  • Proposition 4: Upper bound
  • ...and 34 more