Learning Representations of Instruments for Partial Identification of Treatment Effects

Jonas Schweisthal; Dennis Frauen; Maresa Schröder; Konstantin Hess; Niki Kilbertus; Stefan Feuerriegel

Learning Representations of Instruments for Partial Identification of Treatment Effects

Jonas Schweisthal, Dennis Frauen, Maresa Schröder, Konstantin Hess, Niki Kilbertus, Stefan Feuerriegel

TL;DR

This paper tackles estimating CATE from observational data when unconfoundedness fails by leveraging complex instrumental variables to obtain partial identification bounds. It introduces a novel framework that maps high-dimensional Z to a discrete representation phi(Z) and derives valid, closed-form population bounds on the CATE, b^−(x) and b^+(x), which are tightened by optimally selecting phi. A two-stage neural approach learns tight, variance-conscious bounds: first estimating nuisance functions, then learning the discrete latent phi(Z) via a Gumbel-softmax discretization and a loss that balances bound width against estimation stability. The method is theoretically justified and empirically validated on Mendelian randomization-like simulations, demonstrating 100% coverage and tighter bounds than naive discretization, with robustness to the number of partitions. Overall, this work provides a practical, non-parametric path to using complex IVs (including genetic data, text, images) for reliable causal decision-making under partial identification.

Abstract

Reliable estimation of treatment effects from observational data is important in many disciplines such as medicine. However, estimation is challenging when unconfoundedness as a standard assumption in the causal inference literature is violated. In this work, we leverage arbitrary (potentially high-dimensional) instruments to estimate bounds on the conditional average treatment effect (CATE). Our contributions are three-fold: (1) We propose a novel approach for partial identification through a mapping of instruments to a discrete representation space so that we yield valid bounds on the CATE. This is crucial for reliable decision-making in real-world applications. (2) We derive a two-step procedure that learns tight bounds using a tailored neural partitioning of the latent instrument space. As a result, we avoid instability issues due to numerical approximations or adversarial training. Furthermore, our procedure aims to reduce the estimation variance in finite-sample settings to yield more reliable estimates. (3) We show theoretically that our procedure obtains valid bounds while reducing estimation variance. We further perform extensive experiments to demonstrate the effectiveness across various settings. Overall, our procedure offers a novel path for practitioners to make use of potentially high-dimensional instruments (e.g., as in Mendelian randomization).

Learning Representations of Instruments for Partial Identification of Treatment Effects

TL;DR

Abstract

Learning Representations of Instruments for Partial Identification of Treatment Effects

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (10)