Auditing Fairness under Unobserved Confounding
Yewon Byun, Dylan Sam, Michael Oberst, Zachary C. Lipton, Bryan Wilder
TL;DR
This work tackles the challenge of auditing fairness when unobserved confounding could influence both resource allocation and outcomes. It introduces a partial-identification framework that bounds the treatment rate among the needy, $P(T=1|Y(0)=1, D=1)$, without requiring complete identifiability, by leveraging pre-availability data to estimate baseline risk and covariate information to tighten bounds. The authors develop bias-corrected, cross-fitted estimators for the bounds, establish asymptotic normality under a margin condition, and extend the approach with a bounded confounding sensitivity model parameterized by $\gamma$. They validate the framework on Paxlovid allocation data and semi-synthetic/synthetic settings, showing meaningful inequities that persist under plausible unobserved confounding and demonstrating improved accuracy over plug-in methods. The work provides a principled tool for auditing fairness in real-world decision systems where unobserved risk factors are likely present, with direct implications for policy and resource distribution.
Abstract
Many definitions of fairness or inequity involve unobservable causal quantities that cannot be directly estimated without strong assumptions. For instance, it is particularly difficult to estimate notions of fairness that rely on hard-to-measure concepts such as risk (e.g., quantifying whether patients at the same risk level have equal probability of treatment, regardless of group membership). Such measurements of risk can be accurately obtained when no unobserved confounders have jointly influenced past decisions and outcomes. However, in the real world, this assumption rarely holds. In this paper, we show that, surprisingly, one can still compute meaningful bounds on treatment rates for high-risk individuals (i.e., conditional on their true, \textit{unobserved} negative outcome), even when entirely eliminating or relaxing the assumption that we observe all relevant risk factors used by decision makers. We use the fact that in many real-world settings (e.g., the release of a new treatment) we have data from prior to any allocation to derive unbiased estimates of risk. This result enables us to audit unfair outcomes of existing decision-making systems in a principled manner. We demonstrate the effectiveness of our framework with a real-world study of Paxlovid allocation, provably identifying that observed racial inequity cannot be explained by unobserved confounders of the same strength as important observed covariates.
