Limits of Approximating the Median Treatment Effect

Raghavendra Addanki; Siddharth Bhandari

Limits of Approximating the Median Treatment Effect

Raghavendra Addanki, Siddharth Bhandari

TL;DR

This paper investigates heterogeneity-aware causal inference by examining the Median Treatment Effect (MTE) in a finite population with $k$-ary potential outcomes. It proves that MTE is inestimable from data and introduces an $\epsilon$-quantile-approximation based on the rank of the difference vector $\mathbf{a}-\mathbf{b}$, formalizing a key complexity measure called variability. The authors establish instance-optimality results and provide a linear-time Median-Estimator that achieves near-minimal width by computing variability via two small linear programs and greedy routines. The framework makes no distributional assumptions beyond the $k$-ary nature of outcomes and has practical implications for robust policy decisions in the presence of outcome heterogeneity; it also extends to non-median quantiles and potentially continuous outcomes in future work.

Abstract

Average Treatment Effect (ATE) estimation is a well-studied problem in causal inference. However, it does not necessarily capture the heterogeneity in the data, and several approaches have been proposed to tackle the issue, including estimating the Quantile Treatment Effects. In the finite population setting containing $n$ individuals, with treatment and control values denoted by the potential outcome vectors $\mathbf{a}, \mathbf{b}$, much of the prior work focused on estimating median$(\mathbf{a}) -$ median$(\mathbf{b})$, where median($\mathbf x$) denotes the median value in the sorted ordering of all the values in vector $\mathbf x$. It is known that estimating the difference of medians is easier than the desired estimand of median$(\mathbf{a-b})$, called the Median Treatment Effect (MTE). The fundamental problem of causal inference -- for every individual $i$, we can only observe one of the potential outcome values, i.e., either the value $a_i$ or $b_i$, but not both, makes estimating MTE particularly challenging. In this work, we argue that MTE is not estimable and detail a novel notion of approximation that relies on the sorted order of the values in $\mathbf{a-b}$. Next, we identify a quantity called variability that exactly captures the complexity of MTE estimation. By drawing connections to instance-optimality studied in theoretical computer science, we show that every algorithm for estimating the MTE obtains an approximation error that is no better than the error of an algorithm that computes variability. Finally, we provide a simple linear time algorithm for computing the variability exactly. Unlike much prior work, a particular highlight of our work is that we make no assumptions about how the potential outcome vectors are generated or how they are correlated, except that the potential outcome values are $k$-ary, i.e., take one of $k$ discrete values.

Limits of Approximating the Median Treatment Effect

TL;DR

This paper investigates heterogeneity-aware causal inference by examining the Median Treatment Effect (MTE) in a finite population with

-ary potential outcomes. It proves that MTE is inestimable from data and introduces an

-quantile-approximation based on the rank of the difference vector

, formalizing a key complexity measure called variability. The authors establish instance-optimality results and provide a linear-time Median-Estimator that achieves near-minimal width by computing variability via two small linear programs and greedy routines. The framework makes no distributional assumptions beyond the

-ary nature of outcomes and has practical implications for robust policy decisions in the presence of outcome heterogeneity; it also extends to non-median quantiles and potentially continuous outcomes in future work.

Abstract

individuals, with treatment and control values denoted by the potential outcome vectors

, much of the prior work focused on estimating median

median

, where median(

) denotes the median value in the sorted ordering of all the values in vector

. It is known that estimating the difference of medians is easier than the desired estimand of median

, called the Median Treatment Effect (MTE). The fundamental problem of causal inference -- for every individual

, we can only observe one of the potential outcome values, i.e., either the value

, but not both, makes estimating MTE particularly challenging. In this work, we argue that MTE is not estimable and detail a novel notion of approximation that relies on the sorted order of the values in

. Next, we identify a quantity called variability that exactly captures the complexity of MTE estimation. By drawing connections to instance-optimality studied in theoretical computer science, we show that every algorithm for estimating the MTE obtains an approximation error that is no better than the error of an algorithm that computes variability. Finally, we provide a simple linear time algorithm for computing the variability exactly. Unlike much prior work, a particular highlight of our work is that we make no assumptions about how the potential outcome vectors are generated or how they are correlated, except that the potential outcome values are

-ary, i.e., take one of

discrete values.

Paper Structure (14 sections, 11 theorems, 35 equations, 2 figures, 3 algorithms)

This paper contains 14 sections, 11 theorems, 35 equations, 2 figures, 3 algorithms.

Introduction
Technical Overview
Is Median Estimable?
Approximating the Median
Optimality of Median Approximation
Our Contributions
Preliminaries and Problem Formulation
Formalizing the Algorithm
Variability & Minimum Median Width
Bounds on Variability
Algorithm
Approximating the Median
Computing Variability
Conclusion

Key Result

Theorem 1.1

Let $R_n$ be an RFDT with width $\epsilon^{R_n}(\mathbf{a},\mathbf{b})$ where $(\mathbf{a,b})$ are the treatment and control vectors. If for all $\mathbf{(a,b)}\in \textsc{PO}_k^n \times \textsc{PO}_k^n$, with high probability $R_n$ outputs an median estimate $\widehat{m}$ and width $\epsilon$ such

Figures (2)

Figure 1: Linear Program for computing the lower quantile component of variability
Figure 2: Linear Program for computing the upper quantile component of variability

Theorems & Definitions (32)

Theorem 1.1: Lower Bound on Width of any RFDT (Informal)
Theorem 1.2: Tight Bounds for Minimum Median Width (Informal)
Theorem 1.3: Median Estimator Algorithm (Informal)
Definition 2.1: $k$-ary outcomes
Definition 2.4: Quantiles of a vector
Definition 2.5: Lower Quantile
Definition 2.6: Upper Quantile
Proposition 2.7
proof
Definition 2.8: Variability
...and 22 more

Limits of Approximating the Median Treatment Effect

TL;DR

Abstract

Limits of Approximating the Median Treatment Effect

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (32)