The Categorical Instrumental Variable Model: Characterization, Partial Identification, and Statistical Inference
Yilin Song, F. Richard Guo, K. C. Gary Chan, Thomas S. Richardson
TL;DR
This work analyzes partial identification for categorical instrumental variable models where $Z$, $X$, and $Y$ are finite-valued. It derives a simple closed-form characterization of the joint counterfactual distribution $P'(Y(x_1),\dots,Y(x_K))$ via linear inequalities that link counterfactuals to the observed $P(X,Y\mid Z)$, and shows these are necessary, sufficient, and non-redundant across five IV models defined by exclusion and independence variants. The sufficiency proof uses Strassen's theorem to construct couplings, yielding a polyhedral description that supports sharp bounds on linear functionals such as the average treatment effect and enables a falsification test. For inference, the paper develops a conservative finite-sample CI framework based on a KL-divergence tail bound, implemented through convex programming, and demonstrates practical utility with the Minneapolis Domestic Violence Experiment data, where multi-arm IV analysis is essential.
Abstract
We study categorical instrumental variable (IV) models with instrument, treatment, and outcome taking finitely many values. We derive a simple closed-form characterization of the set of joint distributions of potential outcomes that are compatible with a given observed data distribution in terms of a set of inequalities. These inequalities unify several different IV models defined by versions of the independence and exclusion restriction assumptions and are shown to be non-redundant. Finally, given a set of linear functionals of the joint counterfactual distribution, such as pairwise average treatment effects, we construct confidence intervals with simultaneous finite-sample coverage, using a tail bound on the Kullback--Leibler divergence. We illustrate our method using data from the Minneapolis Domestic Violence Experiment.
