Prediction-Powered E-Values
Daniel Csillag, Claudio José Struchiner, Guilherme Tegoni Goedert
TL;DR
The paper develops prediction-powered inference by embedding predictive imputations into e-values, preserving anytime-validity and post-hoc guarantees while expanding inference beyond Z-estimation. By debiasing imputed terms with Bernoulli sampling and a calibrated epsilon, the framework yields prediction-powered e-values that apply to hypothesis testing, confidence sequences, and complex tasks like change-point detection and causal discovery with costly data. The approach demonstrates significant data-efficiency gains across four real-world-inspired case studies, producing tighter valid intervals and earlier detections than conventional baselines. Practically, this modular method can be integrated into existing algorithms to exploit cheap data streams while maintaining rigorous statistical guarantees.
Abstract
Quality statistical inference requires a sufficient amount of data, which can be missing or hard to obtain. To this end, prediction-powered inference has risen as a promising methodology, but existing approaches are largely limited to Z-estimation problems such as inference of means and quantiles. In this paper, we apply ideas of prediction-powered inference to e-values. By doing so, we inherit all the usual benefits of e-values -- such as anytime-validity, post-hoc validity and versatile sequential inference -- as well as greatly expand the set of inferences achievable in a prediction-powered manner. In particular, we show that every inference procedure that can be framed in terms of e-values has a prediction-powered counterpart, given by our method. We showcase the effectiveness of our framework across a wide range of inference tasks, from simple hypothesis testing and confidence intervals to more involved procedures for change-point detection and causal discovery, which were out of reach of previous techniques. Our approach is modular and easily integrable into existing algorithms, making it a compelling choice for practical applications.
