Table of Contents
Fetching ...

Uncertainty quantification for intervals

Carlos García Meixide, Michael R. Kosorok, Marcos Matabuena

TL;DR

This work addresses uncertainty quantification for interval-censored outcomes by introducing uncervals, a framework that blends conformal inference with bootstrap to construct predictive regions for interval targets. The authors develop a novel interval-process theory with a Donsker-class guarantee, unbiasedness, and bootstrap validity, yielding finite-sample calibration and asymptotic coverage results. Simulations show substantial improvements in conditional coverage (up to around 60%) and demonstrate robustness across scenarios, while real-data applications (sleep, age, and physical activity) illustrate practical gains for interval data in healthcare. The approach provides a general, model-compatible tool for reliable interval-target predictions and goodness-of-fit assessments in settings where exact event times are unknown, with potential extensions to multivariate and truncated data. Overall, uncervals advances uncertainty quantification for interval-censored data, offering theoretical guarantees and actionable insights for precision medicine and digital health.

Abstract

Data following an interval structure are increasingly prevalent in many scientific applications. In medicine, clinical events are often monitored between two clinical visits, making the exact time of the event unknown and generating outcomes with a range format. As interest in automating healthcare decisions grows, uncertainty quantification via predictive regions becomes essential for developing reliable and trustworthy predictive algorithms. However, the statistical literature currently lacks a general methodology for interval targets, especially when these outcomes are incomplete due to censoring. We propose an uncertainty quantification algorithm for interval responses and establish its theoretical properties using empirical process arguments based on a newly developed class of functions specifically designed for these interval data structures. Although this paper primarily focuses on deriving predictive regions for interval-censored data, the approach can also be applied to other statistical modeling tasks, such as goodness-of-fit assessments. Finally, the applicability of the method is demonstrated through simulations, showing up to a 60\% improvement in conditional coverage. Our new algorithm is also applied to various biomedical contexts, including two clinical examples: i) sleep duration and its association with cardiovascular diseases, and ii) survival time in relation to physical activity levels.

Uncertainty quantification for intervals

TL;DR

This work addresses uncertainty quantification for interval-censored outcomes by introducing uncervals, a framework that blends conformal inference with bootstrap to construct predictive regions for interval targets. The authors develop a novel interval-process theory with a Donsker-class guarantee, unbiasedness, and bootstrap validity, yielding finite-sample calibration and asymptotic coverage results. Simulations show substantial improvements in conditional coverage (up to around 60%) and demonstrate robustness across scenarios, while real-data applications (sleep, age, and physical activity) illustrate practical gains for interval data in healthcare. The approach provides a general, model-compatible tool for reliable interval-target predictions and goodness-of-fit assessments in settings where exact event times are unknown, with potential extensions to multivariate and truncated data. Overall, uncervals advances uncertainty quantification for interval-censored data, offering theoretical guarantees and actionable insights for precision medicine and digital health.

Abstract

Data following an interval structure are increasingly prevalent in many scientific applications. In medicine, clinical events are often monitored between two clinical visits, making the exact time of the event unknown and generating outcomes with a range format. As interest in automating healthcare decisions grows, uncertainty quantification via predictive regions becomes essential for developing reliable and trustworthy predictive algorithms. However, the statistical literature currently lacks a general methodology for interval targets, especially when these outcomes are incomplete due to censoring. We propose an uncertainty quantification algorithm for interval responses and establish its theoretical properties using empirical process arguments based on a newly developed class of functions specifically designed for these interval data structures. Although this paper primarily focuses on deriving predictive regions for interval-censored data, the approach can also be applied to other statistical modeling tasks, such as goodness-of-fit assessments. Finally, the applicability of the method is demonstrated through simulations, showing up to a 60\% improvement in conditional coverage. Our new algorithm is also applied to various biomedical contexts, including two clinical examples: i) sleep duration and its association with cardiovascular diseases, and ii) survival time in relation to physical activity levels.
Paper Structure (34 sections, 15 theorems, 55 equations, 10 figures, 12 tables, 1 algorithm)

This paper contains 34 sections, 15 theorems, 55 equations, 10 figures, 12 tables, 1 algorithm.

Key Result

Theorem 1

Set $e=0$, or equivalently, just consider the evaluations of $\hat{F}_1$ on left borders. Suppose that the data are i.i.d. or exchangeable and that the estimator of the conditional survival function is invariant to permutations of the data. Then $\widehat{\mathcal{C}}_{1-\alpha}^{ 0}(\cdot)$ is an L

Figures (10)

  • Figure 1: Illustration of a target variable with an interval structure. Each segment represents a patient. The borders of the black lines indicate medical checkpoints that bound the true, unknown time-to-event. The light grey segments represent right-censored patients, while the dark grey segments represent left-censored patients.
  • Figure 2: Failure of uncertainty quantification approaches for interval data
  • Figure 3: Combinatorial ground of Theorem \ref{['th_donsker']}: an illustration of how the shuttering principle of VC theory acts at the core of the arguments encountered in its proof.
  • Figure 4: Summary statistics for err$\cdot 100$.
  • Figure 5: Boxplots for the distribution of err$\cdot 100$ across simulation runs. Additional cases in the Supplement.
  • ...and 5 more figures

Theorems & Definitions (22)

  • Theorem 1: Finite-sample validity of uncervals
  • Remark 1
  • Definition 1
  • Remark 2
  • Theorem 2
  • Proposition 1
  • Proposition 2
  • Corollary 1
  • Corollary 2
  • Remark 3
  • ...and 12 more