GAAVI: Global Asymptotic Anytime Valid Inference for the Conditional Mean Function
Brian M Cho, Raaz Dwivedi, Nathan Kallus
TL;DR
GAAVI develops nonparametric, asymptotic anytime-valid tests for global null hypotheses on the conditional mean function (CMF) and CATE under general covariate spaces. It constructs a weighted, Gaussian-mixture martingale with a time-uniform lower bound to enable early, error-controlled rejection and inverts these tests to form function-valued confidence sequences. Theoretical guarantees include asymptotic AV type I error control, power-one under mild assumptions, and asymptotically optimal sample complexities relative to Gaussian shift models, even with nuisance estimators that do not converge. Empirical results on synthetic data and a real-world randomized trial demonstrate robust error control and competitive power under continuous monitoring, outperforming discretized or linear-projection baselines in nonlinear settings. The framework enables timely, high-confidence decisions in online experiments, algorithmic auditing, and fairness assessments while accommodating flexible, model-agnostic regression methods.
Abstract
Inference on the conditional mean function (CMF) is central to tasks from adaptive experimentation to optimal treatment assignment and algorithmic fairness auditing. In this work, we provide a novel asymptotic anytime-valid test for a CMF global null (e.g., that all conditional means are zero) and contrasts between CMFs, enabling experimenters to make high confidence decisions at any time during the experiment beyond a minimum sample size. We provide mild conditions under which our tests achieve (i) asymptotic type-I error guarantees, (i) power one, and, unlike past tests, (iii) optimal sample complexity relative to a Gaussian location testing. By inverting our tests, we show how to construct function-valued asymptotic confidence sequences for the CMF and contrasts thereof. Experiments on both synthetic and real-world data show our method is well-powered across various distributions while preserving the nominal error rate under continuous monitoring.
