Table of Contents
Fetching ...

On Efficient Adjustment for Micro Causal Effects in Summary Causal Graphs

Isabela Belciug, Simon Ferreira, Charles K. Assaad

TL;DR

The paper addresses identifying micro causal effects from observational data when the underlying causal structure is represented as a summary causal graph (SCG). It provides a simplified, kinship-based reformulation of identifiability conditions and introduces the SCG-back-door criterion to enumerate multiple valid adjustment sets for estimating ${\Pr}(y_t \mid \text{do}(x_{t-\gamma}))$. It additionally defines a quasi-optimal adjustment set, $q\text{-}opt(X_{t-\gamma}, Y_t)_{\mathcal{G}^s}$, that tends to minimize asymptotic estimation variance across compatible full-time DAGs and relates this set to the classical ${\mathbb{A}}^1$ and ${\mathbb{A}}^2$ bounds. The results offer both theoretical advances and practical tools for efficient causal inference in abstracted time-series graphs, with discussion of limitations, sampling considerations, and future extensions (e.g., front-door in SCGs) that broaden applicability to cohort data and imperfect measurements.

Abstract

Observational studies in fields such as epidemiology often rely on covariate adjustment to estimate causal effects. Classical graphical criteria, like the back-door criterion and the generalized adjustment criterion, are powerful tools for identifying valid adjustment sets in directed acyclic graphs (DAGs). However, these criteria are not directly applicable to summary causal graphs (SCGs), which are abstractions of DAGs commonly used in dynamic systems. In SCGs, each node typically represents an entire time series and may involve cycles, making classical criteria inapplicable for identifying causal effects. Recent work established complete conditions for determining whether the micro causal effect of a treatment or an exposure $X_{t-γ}$ on an outcome $Y_t$ is identifiable via covariate adjustment in SCGs, under the assumption of no hidden confounding. However, these identifiability conditions have two main limitations. First, they are complex, relying on cumbersome definitions and requiring the enumeration of multiple paths in the SCG, which can be computationally expensive. Second, when these conditions are satisfied, they only provide two valid adjustment sets, limiting flexibility in practical applications. In this paper, we propose an equivalent but simpler formulation of those identifiability conditions and introduce a new criterion that identifies a broader class of valid adjustment sets in SCGs. Additionally, we characterize the quasi-optimal adjustment set among these, i.e., the one that minimizes the asymptotic variance of the causal effect estimator. Our contributions offer both theoretical advancement and practical tools for more flexible and efficient causal inference in abstracted causal graphs.

On Efficient Adjustment for Micro Causal Effects in Summary Causal Graphs

TL;DR

The paper addresses identifying micro causal effects from observational data when the underlying causal structure is represented as a summary causal graph (SCG). It provides a simplified, kinship-based reformulation of identifiability conditions and introduces the SCG-back-door criterion to enumerate multiple valid adjustment sets for estimating . It additionally defines a quasi-optimal adjustment set, , that tends to minimize asymptotic estimation variance across compatible full-time DAGs and relates this set to the classical and bounds. The results offer both theoretical advances and practical tools for efficient causal inference in abstracted time-series graphs, with discussion of limitations, sampling considerations, and future extensions (e.g., front-door in SCGs) that broaden applicability to cohort data and imperfect measurements.

Abstract

Observational studies in fields such as epidemiology often rely on covariate adjustment to estimate causal effects. Classical graphical criteria, like the back-door criterion and the generalized adjustment criterion, are powerful tools for identifying valid adjustment sets in directed acyclic graphs (DAGs). However, these criteria are not directly applicable to summary causal graphs (SCGs), which are abstractions of DAGs commonly used in dynamic systems. In SCGs, each node typically represents an entire time series and may involve cycles, making classical criteria inapplicable for identifying causal effects. Recent work established complete conditions for determining whether the micro causal effect of a treatment or an exposure on an outcome is identifiable via covariate adjustment in SCGs, under the assumption of no hidden confounding. However, these identifiability conditions have two main limitations. First, they are complex, relying on cumbersome definitions and requiring the enumeration of multiple paths in the SCG, which can be computationally expensive. Second, when these conditions are satisfied, they only provide two valid adjustment sets, limiting flexibility in practical applications. In this paper, we propose an equivalent but simpler formulation of those identifiability conditions and introduce a new criterion that identifies a broader class of valid adjustment sets in SCGs. Additionally, we characterize the quasi-optimal adjustment set among these, i.e., the one that minimizes the asymptotic variance of the causal effect estimator. Our contributions offer both theoretical advancement and practical tools for more flexible and efficient causal inference in abstracted causal graphs.

Paper Structure

This paper contains 8 sections, 14 theorems, 14 equations, 8 figures.

Key Result

Theorem 1

Consider an SCG ${{\mathcal{G}^{s}}}=({{\mathbb{V}^{s}}},{{\mathbb{E}^{s}}})$. The micro causal effect ${\Pr}(y_t \mid \text{do}(x_{t-\gamma}))$ is identifiable if and only if $X \notin {An}(Y,{\mathcal{G}^{s}})$ or $X\in {An}(Y,{\mathcal{G}^{s}})$ and one of the following holds

Figures (8)

  • Figure 1: Example of an SCG and two compatible FT-DAGs where: $X=$ Sleep Efficiency, $Y=$ Stress , $W=$ Total sleep time Jordan_2023.
  • Figure 2: Example of an SCG and two compatible FT-DAGs where: $X=$ Air pollution levels, $Y=$ Respiratory hospital admissions, $W=$ Traffic density Zhang_2013Dominici_2006.
  • Figure 3: Examples of SCGs where ${\Pr}(y_t \mid \text{do}(x_{t-\gamma}))$ is identifiable because Condition \ref{['cond1']} of Theorem \ref{['theorem:identifiability_total_effect_SCG']} is satisfied.
  • Figure 4: Examples of SCGs where ${\Pr}(y_t \mid \text{do}(x_{t}))$ is identifiable because Condition \ref{['cond2']} of Theorem \ref{['theorem:identifiability_total_effect_SCG']} is satisfied.
  • Figure 5: Example of an SCG and one compatible FT-DAG with three adjustment sets that satisfy condition \ref{['cond1']} of the SCG-back-door criteria and one adjustment set that satisfies the back-door criteria in Pearl_1993BackDoor.
  • ...and 3 more figures

Theorems & Definitions (27)

  • Definition 1: Discrete-time dynamic structural causal model (DTDSCM)
  • Definition 2: Full Time Directed Acyclic Graph (FT-DAG)
  • Definition 3: Summary Causal Graph (SCG) with no hidden confounding
  • Theorem 1
  • Definition 4: SCG-back-door criterion for a micro causal effect
  • Theorem 2
  • Definition 5: Optimal adjustment set in an FT-DAG
  • Definition 6: Quasi-optimal adjustment set in an SCG
  • Proposition 1
  • Proposition 2
  • ...and 17 more