A Convex Framework for Confounding Robust Inference
Kei Ishikawa, Niao He, Takafumi Kanamori
TL;DR
This work addresses robust policy evaluation for offline contextual bandits under unobserved confounding by proposing a convex framework that yields a sharp lower bound on policy value. The core idea is to reformulate the infinite-dimensional conditional moment constraints as tractable convex problems via weight reparameterization and a kernel-based low-rank approximation (KCMC), with strong duality allowing an empirical risk minimization interpretation. The framework supports extensions to f-divergence-based uncertainty, model selection via cross-validation or information criteria, and robust policy learning, while guaranteeing consistency and asymptotic normality of both evaluation and learning procedures. Empirical results demonstrate tighter bounds and effective policy learning across discrete and continuous actions, validating the practical impact of this convex, kernel-based approach for confounding-robust inference.
Abstract
We study policy evaluation of offline contextual bandits subject to unobserved confounders. Sensitivity analysis methods are commonly used to estimate the policy value under the worst-case confounding over a given uncertainty set. However, existing work often resorts to some coarse relaxation of the uncertainty set for the sake of tractability, leading to overly conservative estimation of the policy value. In this paper, we propose a general estimator that provides a sharp lower bound of the policy value using convex programming. The generality of our estimator enables various extensions such as sensitivity analysis with f-divergence, model selection with cross validation and information criterion, and robust policy learning with the sharp lower bound. Furthermore, our estimation method can be reformulated as an empirical risk minimization problem thanks to the strong duality, which enables us to provide strong theoretical guarantees of the proposed estimator using techniques of the M-estimation.
