Table of Contents
Fetching ...

Controlling for discrete unmeasured confounding in nonlinear causal models

Patrick Burauel, Frederick Eberhardt, Michel Besserve

TL;DR

The paper tackles unmeasured discrete confounding in nonlinear causal models by reframing the confounded cause–effect system as a latent-variable model with a Gaussian-mixture prior and a piecewise affine mapping to observed data, enabling identifiability of the causal mechanism up to an invertible affine reparameterization. Building on identifiability results for latent variable models with piecewise affine mappings, it shows that the average causal effect E[Y|do(X=x)] can be recovered from observational data despite latent confounding. The authors implement a flow-based model, DeconFlow, that enforces the causal direction via triangular transforms and allows sampling from backdoor latents to perform do-calculus adjustments. Through synthetic experiments in linear and nonlinear settings and a real-world twin-birth dataset, the method demonstrates reduced bias and improved estimation of causal effects when discrete confounding is present. This work bridges causal inference with deep latent-variable identifiability, while acknowledging limitations related to model class assumptions and outlining paths for future extension to richer graphs and other estimands.

Abstract

Unmeasured confounding is a major challenge for identifying causal relationships from non-experimental data. Here, we propose a method that can accommodate unmeasured discrete confounding. Extending recent identifiability results in deep latent variable models, we show theoretically that confounding can be detected and corrected under the assumption that the observed data is a piecewise affine transformation of a latent Gaussian mixture model and that the identity of the mixture components is confounded. We provide a flow-based algorithm to estimate this model and perform deconfounding. Experimental results on synthetic and real-world data provide support for the effectiveness of our approach.

Controlling for discrete unmeasured confounding in nonlinear causal models

TL;DR

The paper tackles unmeasured discrete confounding in nonlinear causal models by reframing the confounded cause–effect system as a latent-variable model with a Gaussian-mixture prior and a piecewise affine mapping to observed data, enabling identifiability of the causal mechanism up to an invertible affine reparameterization. Building on identifiability results for latent variable models with piecewise affine mappings, it shows that the average causal effect E[Y|do(X=x)] can be recovered from observational data despite latent confounding. The authors implement a flow-based model, DeconFlow, that enforces the causal direction via triangular transforms and allows sampling from backdoor latents to perform do-calculus adjustments. Through synthetic experiments in linear and nonlinear settings and a real-world twin-birth dataset, the method demonstrates reduced bias and improved estimation of causal effects when discrete confounding is present. This work bridges causal inference with deep latent-variable identifiability, while acknowledging limitations related to model class assumptions and outlining paths for future extension to richer graphs and other estimands.

Abstract

Unmeasured confounding is a major challenge for identifying causal relationships from non-experimental data. Here, we propose a method that can accommodate unmeasured discrete confounding. Extending recent identifiability results in deep latent variable models, we show theoretically that confounding can be detected and corrected under the assumption that the observed data is a piecewise affine transformation of a latent Gaussian mixture model and that the identity of the mixture components is confounded. We provide a flow-based algorithm to estimate this model and perform deconfounding. Experimental results on synthetic and real-world data provide support for the effectiveness of our approach.
Paper Structure (19 sections, 6 theorems, 37 equations, 6 figures, 1 algorithm)

This paper contains 19 sections, 6 theorems, 37 equations, 6 figures, 1 algorithm.

Key Result

Theorem 3.4

Under Assumptions assum:partinv, assum:phiinject, and assum:nondeg the mixture components and the causal mechanism for the effect $({\bm{Z}}_Y,{\bm{f}}_Y)$ in Eq. eq_model is identifiable up to an invertible affine reparameterization of ${\bm{Z}}_Y$. More precisely, let $(\tilde{{\bm{Z}}}_Y,\tilde{{

Figures (6)

  • Figure 1: On the left, ${\bm{X}}$ causes ${\bm{Y}}$ and is confounded by $H$. On the right, observed variables ${\bm{W}}=({\bm{X}},{\bm{Y}})$ are generated by latent variables ${\bm{Z}}$, whose identifiability up to affine transformation under model restrictions is shown by kivva2022identifiability. We combine knowledge of causal structure with identifiability results for latent variable models to estimate causal effects despite unmeasured confounding (middle).
  • Figure 2: (Flow model implementation) The sequence of transformations that make up one block are composed of an additive coupling bijection from layer $l$ to $l+1$, see lines 5 and 6, a causal transformation with a partly-diagonal structure (${\bm{Z}}_Y$ node does not influence other nodes), see line 7, from $l+1$ to $l+2$, and a permutation layer from $l+2$ to $l+3$. Line numbers refer to Algorithm \ref{['alg_transformation_block']}.
  • Figure 3: With a one-dimensional cause and one-dimensional confounder, $m=n=1$, performance can be evaluated by comparing the $\mathtt{DeconFlow}$-adjusted slope parameter estimates (orange crosses) to the ground truth (green circles). In addition, we report the naive estimates that are obtained without addressing confounding (red triangles).
  • Figure 4: See Section \ref{['sec_results_synthetic']} for description.
  • Figure 5: See Section \ref{['sec_twins']} for description.
  • ...and 1 more figures

Theorems & Definitions (7)

  • Theorem 3.4
  • Proposition 3.4
  • Proposition 3.4
  • Theorem A.1
  • Proposition A.0
  • Proposition A.0
  • Definition B.1: SCM