Inverse set estimation and inversion of simultaneous confidence intervals
Junting Ren, Fabian J. E. Telschow, Armin Schwartzman
TL;DR
This work develops a finite-sample framework for inverse set estimation by inverting pre-built simultaneous confidence intervals (SCIs). By constructing inner and outer confidence sets for inverse upper, inverse lower, and inverse interval excursion sets, the authors guarantee exact coverage for all levels $c$ in $\mathbb R$ (via the SCI) or conservatively for finite level collections, thereby enabling non-asymptotic, level-flexible inference on $\mu^{-1}(U)$ even on non-dense domains. A non-parametric bootstrap SCI for regression, along with accompanying R code, extends the methodology to linear and logistic regression and to high-dimensional coefficient settings, with robust finite-sample performance demonstrated through comprehensive simulations. The approach is applied to climate risk mapping and to prediction uncertainty in COVID outcomes with statin use, illustrating how policymakers or clinicians can interpret regions or predictor settings that exceed specified thresholds with rigorous error control across many levels. Overall, the paper offers a broadly applicable, data-agnostic toolkit for simultaneous inverse-set inference that accommodates diverse data modalities while guarding against data peeking and multiple-threshold cherry-picking.
Abstract
Motivated by the questions of risk assessment in climatology (temperature change in North America) and medicine (impact of statin usage and COVID-19 on hospitalized patients), we address the problem of estimating the set in the domain of a function whose image equals a predefined subset. Existing methods that construct confidence sets require strict assumptions. We generalize the estimation of such sets to dense and non-dense domains with protection against "data peeking" by proving that confidence sets of multiple levels can be simultaneously constructed with the desired confidence non-asymptotically through inverting simultaneous confidence bands. A non-parametric bootstrap algorithm and code are provided.
