Table of Contents
Fetching ...

Comparing Two Proxy Methods for Causal Identification

Helen Guo, Elizabeth L. Ogburn, Ilya Shpitser

TL;DR

The paper analyzes two nonparametric proxy-based frameworks for causal identification in the presence of unmeasured variables: bridge equations (proximal causal learning) and array decomposition. It clarifies each method's underlying model restrictions, completeness and invertibility requirements, and how they enable identification of interventional distributions or full latent–observed laws. By comparing discrete and continuous formulations and detailing labeling strategies for latent states, the work illuminates when each approach is most appropriate and how the assumptions diverge. The findings emphasize that the two frameworks are non-nested and complementary, guiding researchers in choosing the framework aligned with their proxy structure and identifiability goals. It also points to estimation directions and future work on semiparametric theory for these proximal methods.

Abstract

Identifying causal effects in the presence of unmeasured variables is a fundamental challenge in causal inference, for which proxy variable methods have emerged as a powerful solution. We contrast two major approaches in this framework: (1) bridge equation methods, which leverage solutions to integral equations to recover causal targets, and (2) array decomposition methods, which recover latent factors composing counterfactual quantities by exploiting unique determination of eigenspaces. We compare the model restrictions underlying these two approaches and provide insight into implications of the underlying assumptions, clarifying the scope of applicability for each method.

Comparing Two Proxy Methods for Causal Identification

TL;DR

The paper analyzes two nonparametric proxy-based frameworks for causal identification in the presence of unmeasured variables: bridge equations (proximal causal learning) and array decomposition. It clarifies each method's underlying model restrictions, completeness and invertibility requirements, and how they enable identification of interventional distributions or full latent–observed laws. By comparing discrete and continuous formulations and detailing labeling strategies for latent states, the work illuminates when each approach is most appropriate and how the assumptions diverge. The findings emphasize that the two frameworks are non-nested and complementary, guiding researchers in choosing the framework aligned with their proxy structure and identifiability goals. It also points to estimation directions and future work on semiparametric theory for these proximal methods.

Abstract

Identifying causal effects in the presence of unmeasured variables is a fundamental challenge in causal inference, for which proxy variable methods have emerged as a powerful solution. We contrast two major approaches in this framework: (1) bridge equation methods, which leverage solutions to integral equations to recover causal targets, and (2) array decomposition methods, which recover latent factors composing counterfactual quantities by exploiting unique determination of eigenspaces. We compare the model restrictions underlying these two approaches and provide insight into implications of the underlying assumptions, clarifying the scope of applicability for each method.

Paper Structure

This paper contains 20 sections, 2 theorems, 50 equations, 6 figures, 2 tables.

Key Result

Theorem 3.5

A three-way array $\boldsymbol{\mathcal{T}}$ is essentially unique and has three-way rank R if the following condition holds: where $k_A$, $k_B$, and $k_C$ are the $k$-ranks of $\mathbf{A}$, $\mathbf{B}$, and $\mathbf{C}$, respectively.

Figures (6)

  • Figure 1: Observed confounding; Supports $\mathcal{A}, \mathcal{Y}, \mathcal{C}$
  • Figure 2: Miao et al. (2018); Supports $\mathcal{A}, \mathcal{Y}, \mathcal{U}, \mathcal{W}, \mathcal{Z}$
  • Figure 3: Kuroki and Pearl (2014); Supports $\mathcal{A}, \mathcal{Y}, \mathcal{U}, \mathcal{W}, \mathcal{Z}$
  • Figure 4: Latent Variable with Proxies; Supports $\mathcal{ Y,W,Z, L}$
  • Figure 5: Hidden confounding with observed mediation; Supports $\mathcal{A,M,Y,U}$
  • ...and 1 more figures

Theorems & Definitions (7)

  • Definition 3.1: CP Decomposition
  • Definition 3.2: Essential Uniqueness
  • Definition 3.3: Three-Way Rank
  • Definition 3.4: Kruskal Rank ($k$-rank)
  • Theorem 3.5: Kruskal's Uniqueness Theorem
  • Lemma B.1
  • proof