Multiple-Prediction-Powered Inference

Charlie Cowen-Breen; Alekh Agarwal; Stephen Bates; William W. Cohen; Jacob Eisenstein; Amir Globerson; Adam Fisch

Multiple-Prediction-Powered Inference

Charlie Cowen-Breen, Alekh Agarwal, Stephen Bates, William W. Cohen, Jacob Eisenstein, Amir Globerson, Adam Fisch

Abstract

Statistical estimation often involves tradeoffs between expensive, high-quality measurements and a variety of lower-quality proxies. We introduce Multiple-Prediction-Powered Inference (MultiPPI): a general framework for constructing statistically efficient estimates by optimally allocating resources across these diverse data sources. This work provides theoretical guarantees about the minimax optimality, finite-sample performance, and asymptotic normality of the MultiPPI estimator. Through experiments across three diverse large language model (LLM) evaluation scenarios, we show that MultiPPI consistently achieves lower estimation error than existing baselines. This advantage stems from its budget-adaptive allocation strategy, which strategically combines subsets of models by learning their complex cost and correlation structures.

Multiple-Prediction-Powered Inference

Abstract

Multiple-Prediction-Powered Inference

Abstract

Paper Structure

Table of Contents

Key Result

Figures (17)

Theorems & Definitions (46)