Data-driven Error Estimation: Upper Bounding Multiple Errors without Class Complexity as Input
Sanath Kumar Krishnamurthy, Anna Lyubarskaja, Emma Brunskill, Susan Athey
TL;DR
This work tackles the problem of constructing high-probability upper bounds on the maximum estimation error across a class of estimation tasks without requiring explicit bounds on class complexity. It introduces a data-driven framework that uses a defining dataset to form per-task estimates and a separate holdout error dataset to bound each individual error, then yields a bound on the maximum error via the maximum of these per-task bounds, independent of the class size. The core theory shows that this bound holds with probability at least $1-\delta$ and naturally adapts to the correlation structure of errors, enabling simultaneous confidence intervals, excess-risk estimation, and potentially optimal model sets without traditional complexity terms. Localization further tightens bounds for particular instances, and the paper provides practical recipes for constructing multiple mean CIs and for contextual-bandit pipelines that rely on error estimates. Overall, the approach offers a simple, flexible, and computation-efficient alternative to union bounds, bootstrap, or PAC-Bayes in large or infinite hypothesis spaces.
Abstract
Constructing confidence intervals that are simultaneously valid across a class of estimates is central for tasks such as multiple mean estimation, bounding generalization error in machine learning, and adaptive experimental design. We frame this as an "error estimation problem," where the goal is to determine a high-probability upper bound on the maximum error for a class of estimates. We propose an entirely data-driven approach that derives such bounds for both finite and infinite class settings, naturally adapting to a potentially unknown correlation structure of random errors. Notably, our method does not require class complexity as an input, overcoming a major limitation of existing approaches such as union bounding and bounds based on Talagrand's inequality. In this paper, we present our simple yet general solution and demonstrate its flexibility through applications ranging from constructing multiple simultaneously valid confidence intervals to optimizing exploration in contextual bandit algorithms.
