Stochastic Neural Networks for Causal Inference with Missing Confounders

Yaxin Fang; Faming Liang

Stochastic Neural Networks for Causal Inference with Missing Confounders

Yaxin Fang, Faming Liang

TL;DR

Confounder Imputation with Stochastic Neural Networks (CI-StoNet), which parameterizes the conditional structure of a causal directed acyclic graph using a stochastic neural network and imputes latent confounders via adaptive stochastic-gradient Hamiltonian Monte Carlo and the effect of overlap on estimation accuracy is characterized.

Abstract

Unmeasured confounding is a fundamental obstacle to causal inference from observational data. Latent-variable methods address this challenge by imputing unobserved confounders, yet many lack explicit model-based identification guarantees and are difficult to extend to richer causal structures. We propose Confounder Imputation with Stochastic Neural Networks (CI-StoNet), which parameterizes the conditional structure of a causal directed acyclic graph using a stochastic neural network and imputes latent confounders via adaptive stochastic-gradient Hamiltonian Monte Carlo. Under SUTVA and overlap, and assuming that the structural components of the data-generating process are well approximated by a capacity-controlled sparse deep neural network class, we establish model identification and consistent estimation of the mean potential outcome under a fixed intervention within this class. Although the latent confounder is identifiable only up to reparameterizations that preserve the joint treatment-outcome distribution, the causal estimand is invariant across this observationally equivalent class. We further characterize the effect of overlap on estimation accuracy. Empirical results on simulated and benchmark datasets demonstrate accurate performance, and the framework extends naturally to proxy-variable and multiple-cause settings with overlap diagnostics and bootstrap-based uncertainty quantification.

Stochastic Neural Networks for Causal Inference with Missing Confounders

TL;DR

Abstract

Paper Structure (50 sections, 12 theorems, 114 equations, 7 figures, 5 tables, 1 algorithm)

This paper contains 50 sections, 12 theorems, 114 equations, 7 figures, 5 tables, 1 algorithm.

Introduction
CI-StoNet for Missing Confounders
The CI-StoNet Approach
Causal effect estimation.
Theoretical Guarantees
Model-based Identifiability
Theoretical Analysis
A Simulation Study
Causal StoNet for Proxy Variables
Numerical Experiments
Simulated Examples
Benchmark Datasets
Conclusion
Appendix
Supplementary Examples
...and 35 more sections

Key Result

Theorem 1

(Estimation error)Suppose Assumptions ass:0-ass:1 and the conditions in Lemma thm:lem1 and Theorem thm:lem2 (stated in Supplement sect:proof) hold. Then

Figures (7)

Figure 1: simple confounding
Figure 2: Diagram of CI-StoNet under simple confounding, where white rectangles represent variables from observed data; light-grey rounded-rectangles represent latent variable to impute; and dark-grey rectangles represent neural network modules to learn respective conditional distributions.
Figure 3: Other examples of causal structures: (a) existence of colliders, represented by $C$ ; (b) existence of mediators, represented by $M$.
Figure 4: (a) Causal DAG: without dependence on the proxy; (b) Diagram of CI-StoNet under the proxy setting: white rectangles represent variables from observed data; light-grey rounded-rectangles represent hidden neurons; dark-grey rectangles represent network modules to learn respective conditional distributions.
Figure S1: CI-StoNet results for the simulation study. Points denote the true marginal effects and error bars represent one standard error of the marginal effect estimator.
...and 2 more figures

Theorems & Definitions (26)

Theorem 1
Remark 1
Theorem 2
Lemma A1
proof
Remark A1
Lemma A2
Lemma A3
Lemma A4
proof
...and 16 more

Stochastic Neural Networks for Causal Inference with Missing Confounders

TL;DR

Abstract

Stochastic Neural Networks for Causal Inference with Missing Confounders

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (26)