Hidden yet quantifiable: A lower bound for confounding strength using randomized trials
Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang
TL;DR
This paper tackles unobserved confounding in observational causal analysis by proposing a principled use of randomized trials to detect and quantify confounding strength. It introduces a marginal sensitivity framework with a confounding strength parameter $\Gamma$, a transportability-based setup, and two asymptotically valid tests that leverage either CATE bounds or ATE bounds to decide if confounding is above a user-specified threshold. A practical lower bound $\hat{\Gamma}_{\texttt{LB}}$ is derived, enabling researchers to conclude whether confounding is substantial enough to threaten conclusions. The approach is validated on synthetic, semi-synthetic (real trial supplemented with observational variants), and real WHI/HRT data, demonstrating that the method can distinguish meaningful from negligible confounding and guide corrective actions. Overall, the framework provides a quantitative, testable measure of unobserved confounding that can inform post-marketing surveillance and causal inferences in practice.
Abstract
In the era of fast-paced precision medicine, observational studies play a major role in properly evaluating new treatments in clinical practice. Yet, unobserved confounding can significantly compromise causal conclusions drawn from non-randomized data. We propose a novel strategy that leverages randomized trials to quantify unobserved confounding. First, we design a statistical test to detect unobserved confounding with strength above a given threshold. Then, we use the test to estimate an asymptotically valid lower bound on the unobserved confounding strength. We evaluate the power and validity of our statistical test on several synthetic and semi-synthetic datasets. Further, we show how our lower bound can correctly identify the absence and presence of unobserved confounding in a real-world setting.
