The Role of Measured Covariates in Assessing Sensitivity to Unmeasured Confounding
Abhinandan Dalal, Iris Horng, Yang Feng, Dylan S. Small
TL;DR
The paper tackles how the structure of measured covariates shapes sensitivity to unmeasured confounding in observational causal inference. It derives a bias amplification result in linear regression, showing that the impact of unmeasured confounding on the estimated effect is magnified when a proxy $X$ for a latent factor $U$ is strongly associated with the exposure $A$ and measured with error, formalized through the observable ratio $\beta_{A\sim X}/(\mathrm{Var}(A)(1-R^2_{A\sim X}))$. The authors apply the framework to smoking and lung cancer using NHANES data from 1971–74 and 2015–16, revealing that evolving socioeconomic patterns have increased the exposure–covariate coupling and thus the sensitivity to residual confounding. They propose a practical diagnostic that combines the observed exposure–covariate association with the residual variance to gauge robustness of causal conclusions in proxy-adjusted analyses. Overall, the work provides a principled way to quantify and monitor how multicollinearity and proxy quality affect the credibility of causal claims in observational studies.
Abstract
Sensitivity analysis is widely used to assess the robustness of causal conclusions in observational studies, yet its interaction with the structure of measured covariates is often overlooked. When latent confounders cannot be directly adjusted for and are instead controlled using proxy variables, strong associations between exposure and measured proxies can amplify sensitivity to residual confounding. We formalize this phenomenon in linear regression settings by showing that a simple ratio involving the exposure model coefficient and residual exposure variance provides an observable measure of this increased sensitivity. Applying our framework to smoking and lung cancer, we document how growing socioeconomic stratification in smoking behavior over time leads to heightened sensitivity to unmeasured confounding in more recent data. These results highlight the importance of multicollinearity when interpreting sensitivity analyses based on proxy adjustment.
