Data-light Uncertainty Set Merging with Admissibility
Shenghao Qin, Jianliang He, Qi Kuang, Bowen Gang, Yin Xia
TL;DR
This work addresses merging multiple uncertainty sets when only the initial sets and their miscoverage controls are available, possibly with dependence. It introduces SAT, a data-light framework that converts set-merging into aggregating synthetic statistics (e-values or p-values) and then inverting a hypothesis test to form a merged set with finite-sample coverage guarantees. The key theoretical contributions include admissibility results for deterministic SAT (with general dependence) and valid aggregation schemes, plus practical implementations for infinite candidate spaces. Through extensive simulations and a real-data application to ImageNet_val, SAT demonstrates reliable coverage while delivering smaller merged sets compared to individual inputs, highlighting its potential for distributed, privacy-preserving, or algorithm-ensemble uncertainty merging. The method provides a principled, flexible toolkit for integrating diverse conformal- or confidence-based uncertainty sets under limited data access and varying dependencies.
Abstract
This article introduces a Synthetics, Aggregation, and Test inversion (SAT) approach for merging diverse and potentially dependent uncertainty sets into a single unified set. The procedure is data-light, relying only on initial sets and their nominal levels, and it flexibly adapts to user-specified input sets with possibly varying coverage guarantees. SAT is motivated by the challenge of integrating uncertainty sets when only the initial sets and their control levels are available-for example, when merging confidence sets from distributed sites under communication constraints or combining conformal prediction sets generated by different algorithms or data splits. To address this, SAT constructs and aggregates novel synthetic test statistics, and then derive merged sets through test inversion. Our method leverages the duality between set estimation and hypothesis testing, ensuring reliable coverage in dependent scenarios. A key theoretical contribution is a rigorous analysis of SAT's properties, including its admissibility in the context of deterministic set merging. Both theoretical analyses and empirical results confirm the method's finite-sample coverage validity and desirable set sizes.
