Table of Contents
Fetching ...

Spectral Representation for Causal Estimation with Hidden Confounders

Haotian Sun, Antoine Moulin, Tongzheng Ren, Arthur Gretton, Bo Dai

TL;DR

This work tackles causal effect estimation in the presence of hidden confounders, focusing on IV regression with observed confounders and Proxy Causal Learning (PCL). It introduces a spectral representation built on a low-rank factorization of the conditional expectation operator and trains via a saddle-point objective to learn the causal function and its dual, enabling efficient representation learning and avoiding double-sampling bias. The authors derive explicit function classes for IV, IV-OC, and PCL, propose contrastive learning to discover the spectral bases, and develop practical algorithms (SpecIV and SpecPCL) with finite-dimensional representations. Empirical results on dSprites and Demand Design show that SpecIV/SpecPCL achieve state-of-the-art accuracy and efficiency across IV and PCL benchmarks, validating the approach's scalability and robustness.

Abstract

We address the problem of causal effect estimation where hidden confounders are present, with a focus on two settings: instrumental variable regression with additional observed confounders, and proxy causal learning. Our approach uses a singular value decomposition of a conditional expectation operator, followed by a saddle-point optimization problem, which, in the context of IV regression, can be thought of as a neural net generalization of the seminal approach due to Darolles et al. [2011]. Saddle-point formulations have gathered considerable attention recently, as they can avoid double sampling bias and are amenable to modern function approximation methods. We provide experimental validation in various settings, and show that our approach outperforms existing methods on common benchmarks.

Spectral Representation for Causal Estimation with Hidden Confounders

TL;DR

This work tackles causal effect estimation in the presence of hidden confounders, focusing on IV regression with observed confounders and Proxy Causal Learning (PCL). It introduces a spectral representation built on a low-rank factorization of the conditional expectation operator and trains via a saddle-point objective to learn the causal function and its dual, enabling efficient representation learning and avoiding double-sampling bias. The authors derive explicit function classes for IV, IV-OC, and PCL, propose contrastive learning to discover the spectral bases, and develop practical algorithms (SpecIV and SpecPCL) with finite-dimensional representations. Empirical results on dSprites and Demand Design show that SpecIV/SpecPCL achieve state-of-the-art accuracy and efficiency across IV and PCL benchmarks, validating the approach's scalability and robustness.

Abstract

We address the problem of causal effect estimation where hidden confounders are present, with a focus on two settings: instrumental variable regression with additional observed confounders, and proxy causal learning. Our approach uses a singular value decomposition of a conditional expectation operator, followed by a saddle-point optimization problem, which, in the context of IV regression, can be thought of as a neural net generalization of the seminal approach due to Darolles et al. [2011]. Saddle-point formulations have gathered considerable attention recently, as they can avoid double sampling bias and are amenable to modern function approximation methods. We provide experimental validation in various settings, and show that our approach outperforms existing methods on common benchmarks.
Paper Structure (50 sections, 12 theorems, 54 equations, 4 figures, 6 tables, 2 algorithms)

This paper contains 50 sections, 12 theorems, 54 equations, 4 figures, 6 tables, 2 algorithms.

Key Result

Proposition 0

If Assumption asp:low-rank-iv holds, then for any function $f \in L_2 \left( \mathbb{P}_X \right)$, there exists a vector $v_f \in \mathbb{R}^d$ such that $E f = \left\langle \psi \left( Z \right), v_f \right\rangle$.

Figures (4)

  • Figure 1: Causal graphs for the three settings: (a) IV regression, (b) IV regression with observed confounder ($O$), and (c) proxy causal inference. Gray nodes represent observed variables; white nodes are unobserved.
  • Figure 2: MSE and Runtime on dSprites Dataset with (a) Low-Dimensional Instruments (32) and (b) High-Dimensional Instruments (64), and Demand Design dataset with (c) 5,000 and (d) 10,000 training samples. Methods employing pre-specified features, i.e., DE and KIV, are represented as dashed baselines.
  • Figure 3: MSE and Runtime of structural function estimation on the dSprites dataset with 5,000 training samples.
  • Figure 4: MSE comparison of (a) SpecIV and DFIV on dSprites dataset with 5,000 labeled training samples and (b) SpecIV on Demand Design Dataset with 5,000 and 10,000 labeled samples, respectively. Each method is trained with the specified amount of labeled data and an additional set of unlabeled samples.

Theorems & Definitions (12)

  • Proposition 0
  • Proposition 0: Dual class for IV regression
  • Proposition 0: Primal space for IV
  • Proposition 0
  • Proposition 0: Primal space for IV-OC
  • Proposition 0: Dual space for IV-OC
  • Proposition 0
  • Proposition 0: Dual class for IV regression
  • Proposition 0: Primal space for IV
  • Proposition 0
  • ...and 2 more