Table of Contents
Fetching ...

Limits of Approximating the Median Treatment Effect

Raghavendra Addanki, Siddharth Bhandari

TL;DR

This paper investigates heterogeneity-aware causal inference by examining the Median Treatment Effect (MTE) in a finite population with $k$-ary potential outcomes. It proves that MTE is inestimable from data and introduces an $\epsilon$-quantile-approximation based on the rank of the difference vector $\mathbf{a}-\mathbf{b}$, formalizing a key complexity measure called variability. The authors establish instance-optimality results and provide a linear-time Median-Estimator that achieves near-minimal width by computing variability via two small linear programs and greedy routines. The framework makes no distributional assumptions beyond the $k$-ary nature of outcomes and has practical implications for robust policy decisions in the presence of outcome heterogeneity; it also extends to non-median quantiles and potentially continuous outcomes in future work.

Abstract

Average Treatment Effect (ATE) estimation is a well-studied problem in causal inference. However, it does not necessarily capture the heterogeneity in the data, and several approaches have been proposed to tackle the issue, including estimating the Quantile Treatment Effects. In the finite population setting containing $n$ individuals, with treatment and control values denoted by the potential outcome vectors $\mathbf{a}, \mathbf{b}$, much of the prior work focused on estimating median$(\mathbf{a}) -$ median$(\mathbf{b})$, where median($\mathbf x$) denotes the median value in the sorted ordering of all the values in vector $\mathbf x$. It is known that estimating the difference of medians is easier than the desired estimand of median$(\mathbf{a-b})$, called the Median Treatment Effect (MTE). The fundamental problem of causal inference -- for every individual $i$, we can only observe one of the potential outcome values, i.e., either the value $a_i$ or $b_i$, but not both, makes estimating MTE particularly challenging. In this work, we argue that MTE is not estimable and detail a novel notion of approximation that relies on the sorted order of the values in $\mathbf{a-b}$. Next, we identify a quantity called variability that exactly captures the complexity of MTE estimation. By drawing connections to instance-optimality studied in theoretical computer science, we show that every algorithm for estimating the MTE obtains an approximation error that is no better than the error of an algorithm that computes variability. Finally, we provide a simple linear time algorithm for computing the variability exactly. Unlike much prior work, a particular highlight of our work is that we make no assumptions about how the potential outcome vectors are generated or how they are correlated, except that the potential outcome values are $k$-ary, i.e., take one of $k$ discrete values.

Limits of Approximating the Median Treatment Effect

TL;DR

This paper investigates heterogeneity-aware causal inference by examining the Median Treatment Effect (MTE) in a finite population with -ary potential outcomes. It proves that MTE is inestimable from data and introduces an -quantile-approximation based on the rank of the difference vector , formalizing a key complexity measure called variability. The authors establish instance-optimality results and provide a linear-time Median-Estimator that achieves near-minimal width by computing variability via two small linear programs and greedy routines. The framework makes no distributional assumptions beyond the -ary nature of outcomes and has practical implications for robust policy decisions in the presence of outcome heterogeneity; it also extends to non-median quantiles and potentially continuous outcomes in future work.

Abstract

Average Treatment Effect (ATE) estimation is a well-studied problem in causal inference. However, it does not necessarily capture the heterogeneity in the data, and several approaches have been proposed to tackle the issue, including estimating the Quantile Treatment Effects. In the finite population setting containing individuals, with treatment and control values denoted by the potential outcome vectors , much of the prior work focused on estimating median median, where median() denotes the median value in the sorted ordering of all the values in vector . It is known that estimating the difference of medians is easier than the desired estimand of median, called the Median Treatment Effect (MTE). The fundamental problem of causal inference -- for every individual , we can only observe one of the potential outcome values, i.e., either the value or , but not both, makes estimating MTE particularly challenging. In this work, we argue that MTE is not estimable and detail a novel notion of approximation that relies on the sorted order of the values in . Next, we identify a quantity called variability that exactly captures the complexity of MTE estimation. By drawing connections to instance-optimality studied in theoretical computer science, we show that every algorithm for estimating the MTE obtains an approximation error that is no better than the error of an algorithm that computes variability. Finally, we provide a simple linear time algorithm for computing the variability exactly. Unlike much prior work, a particular highlight of our work is that we make no assumptions about how the potential outcome vectors are generated or how they are correlated, except that the potential outcome values are -ary, i.e., take one of discrete values.
Paper Structure (14 sections, 11 theorems, 35 equations, 2 figures, 3 algorithms)

This paper contains 14 sections, 11 theorems, 35 equations, 2 figures, 3 algorithms.

Key Result

Theorem 1.1

Let $R_n$ be an RFDT with width $\epsilon^{R_n}(\mathbf{a},\mathbf{b})$ where $(\mathbf{a,b})$ are the treatment and control vectors. If for all $\mathbf{(a,b)}\in \textsc{PO}_k^n \times \textsc{PO}_k^n$, with high probability $R_n$ outputs an median estimate $\widehat{m}$ and width $\epsilon$ such

Figures (2)

  • Figure 1: Linear Program for computing the lower quantile component of variability
  • Figure 2: Linear Program for computing the upper quantile component of variability

Theorems & Definitions (32)

  • Theorem 1.1: Lower Bound on Width of any RFDT (Informal)
  • Theorem 1.2: Tight Bounds for Minimum Median Width (Informal)
  • Theorem 1.3: Median Estimator Algorithm (Informal)
  • Definition 2.1: $k$-ary outcomes
  • Definition 2.4: Quantiles of a vector
  • Definition 2.5: Lower Quantile
  • Definition 2.6: Upper Quantile
  • Proposition 2.7
  • proof
  • Definition 2.8: Variability
  • ...and 22 more