Table of Contents
Fetching ...

Causal computations in Semi Markovian Structural Causal Models using divide and conquer

Anna Rodum Bjøru, Rafael Cabañas, Helge Langseth, Antonio Salmerón

TL;DR

The paper tackles bounding counterfactuals in semi-Markovian SCMs when exogenous marginals are incompletely known. It extends the Divide and Conquer for Causal Computation (DCCC) framework to handle exogenous confounding, deriving exact extensions for specific semi-Markovian topologies and two Markovian approximations (endogenous merge and exogenous split). By incorporating observational and interventional data, the authors show tighter bounds and, in some cases, identifiability of interventional queries, while demonstrating substantial computational advantages over existing methods like EMCC and Gibbs sampling. The results indicate that combining observational and experimental data yields the most informative bounds, and the proposed approximations offer a favorable trade-off between accuracy and efficiency. The work lays groundwork for generalizing DCCC to broader semi-Markovian structures and provides practical guidelines for translating semi-Markovian problems into solvable Markovian equivalents.

Abstract

Recently, Bjøru et al. proposed a novel divide-and-conquer algorithm for bounding counterfactual probabilities in structural causal models (SCMs). They assumed that the SCMs were learned from purely observational data, leading to an imprecise characterization of the marginal distributions of exogenous variables. Their method leveraged the canonical representation of structural equations to decompose a general SCM with high-cardinality exogenous variables into a set of sub-models with low-cardinality exogenous variables. These sub-models had precise marginals over the exogenous variables and therefore admitted efficient exact inference. The aggregated results were used to bound counterfactual probabilities in the original model. The approach was developed for Markovian models, where each exogenous variable affects only a single endogenous variable. In this paper, we investigate extending the methodology to \textit{semi-Markovian} SCMs, where exogenous variables may influence multiple endogenous variables. Such models are capable of representing confounding relationships that Markovian models cannot. We illustrate the challenges of this extension using a minimal example, which motivates a set of alternative solution strategies. These strategies are evaluated both theoretically and through a computational study.

Causal computations in Semi Markovian Structural Causal Models using divide and conquer

TL;DR

The paper tackles bounding counterfactuals in semi-Markovian SCMs when exogenous marginals are incompletely known. It extends the Divide and Conquer for Causal Computation (DCCC) framework to handle exogenous confounding, deriving exact extensions for specific semi-Markovian topologies and two Markovian approximations (endogenous merge and exogenous split). By incorporating observational and interventional data, the authors show tighter bounds and, in some cases, identifiability of interventional queries, while demonstrating substantial computational advantages over existing methods like EMCC and Gibbs sampling. The results indicate that combining observational and experimental data yields the most informative bounds, and the proposed approximations offer a favorable trade-off between accuracy and efficiency. The work lays groundwork for generalizing DCCC to broader semi-Markovian structures and provides practical guidelines for translating semi-Markovian problems into solvable Markovian equivalents.

Abstract

Recently, Bjøru et al. proposed a novel divide-and-conquer algorithm for bounding counterfactual probabilities in structural causal models (SCMs). They assumed that the SCMs were learned from purely observational data, leading to an imprecise characterization of the marginal distributions of exogenous variables. Their method leveraged the canonical representation of structural equations to decompose a general SCM with high-cardinality exogenous variables into a set of sub-models with low-cardinality exogenous variables. These sub-models had precise marginals over the exogenous variables and therefore admitted efficient exact inference. The aggregated results were used to bound counterfactual probabilities in the original model. The approach was developed for Markovian models, where each exogenous variable affects only a single endogenous variable. In this paper, we investigate extending the methodology to \textit{semi-Markovian} SCMs, where exogenous variables may influence multiple endogenous variables. Such models are capable of representing confounding relationships that Markovian models cannot. We illustrate the challenges of this extension using a minimal example, which motivates a set of alternative solution strategies. These strategies are evaluated both theoretically and through a computational study.

Paper Structure

This paper contains 24 sections, 19 equations, 7 figures, 9 tables.

Figures (7)

  • Figure 1: Elements of an SCM: (left) causal graph, (center) structural equations and (right) empirical distribution computed from the data.
  • Figure 2: Graph of the example Markovian model.
  • Figure 3: Graph of the example model for semi Markovian discussion.
  • Figure 4: Markovian approximation of Figure \ref{['fig:modelexample']}, replacing variables $Y_1, Y_2$ that share exogenous parent $U$ by new variable $Y = (Y_1, Y_2)$ with domain $\Omega_{Y} = \{00, 01, 10, 11\}$ and new exogenous parent $U^*$.
  • Figure 5: The figure shows boxplots of the root mean squared error (RMSE) calculated for intervals computed with S-O and MM-O in blue, and S-O and MS-O in red.
  • ...and 2 more figures

Theorems & Definitions (2)

  • Definition 1: Reduction operator
  • Example 1