Inference for Heterogeneous Treatment Effects with Efficient Instruments and Machine Learning
Cyrill Scheidegger, Zijian Guo, Peter Bühlmann
TL;DR
The paper develops a novel instrumental-variable framework for inference on heterogeneous treatment effects under endogeneity, combining double/debiased machine learning with efficient instruments learned from data and kernel smoothing for univariate heterogeneity. It provides consistency and asymptotic normality results, along with robust weak-IV confidence sets, and extends the methodology to homogeneous effects with practical guidance. Through simulations and two real-data applications, the approach demonstrates accuracy and improved coverage under weak instruments when using robust inference, and clarifies situations where learning efficient instruments offers gains. The work delivers an accessible implementation in the R package IVDML, enabling practitioners to estimate and draw inference on β(v) while flexibly accommodating nuisance-function estimation via modern ML tools.
Abstract
We introduce a new instrumental variable (IV) estimator for heterogeneous treatment effects in the presence of endogeneity. Our estimator is based on double/debiased machine learning (DML) and uses efficient machine learning instruments (MLIV) and kernel smoothing. We prove consistency and asymptotic normality of our estimator and also construct confidence sets that are more robust towards weak IV. Along the way, we also provide an accessible discussion of the corresponding estimator for the homogeneous treatment effect with efficient machine learning instruments. The methods are evaluated on synthetic and real datasets and an implementation is made available in the R package IVDML.
