Causal Effect Estimation with Learned Instrument Representations
Frances Dean, Jenna Fields, Radhika Bhalerao, Marie Charpignon, Ahmed Alaa
TL;DR
This work tackles unobserved confounding in observational causal inference by learning instrumental representations directly from data. It introduces ZNet, an encoder-based model that decomposes covariates into a confounding component and a learned instrument, enforcing IV moment conditions to produce valid representations for downstream two-stage IV estimators. Through both semi-synthetic and unstructured data experiments, ZNet recovers existing instruments when present and demonstrates strong performance with latent instruments, outperforming TARNet and other IV-generation baselines in estimating ATE and CATE. The approach broadens the applicability of IV methods to high-dimensional and non-tabular data, offering a practical plug-in for causal inference where explicit instruments are unavailable or uncertain.
Abstract
Instrumental variable (IV) methods mitigate bias from unobserved confounding in observational causal inference but rely on the availability of a valid instrument, which can often be difficult or infeasible to identify in practice. In this paper, we propose a representation learning approach that constructs instrumental representations from observed covariates, which enable IV-based estimation even in the absence of an explicit instrument. Our model (ZNet) achieves this through an architecture that mirrors the structural causal model of IVs; it decomposes the ambient feature space into confounding and instrumental components, and is trained by enforcing empirical moment conditions corresponding to the defining properties of valid instruments (i.e., relevance, exclusion restriction, and instrumental unconfoundedness). Importantly, ZNet is compatible with a wide range of downstream two-stage IV estimators of causal effects. Our experiments demonstrate that ZNet can (i) recover ground-truth instruments when they already exist in the ambient feature space and (ii) construct latent instruments in the embedding space when no explicit IVs are available. This suggests that ZNet can be used as a ``plug-and-play'' module for causal inference in general observational settings, regardless of whether the (untestable) assumption of unconfoundedness is satisfied.
