Confidence Interval Construction and Conditional Variance Estimation with Dense ReLU Networks
Carlos Misael Madrid Padilla, Oscar Hernan Madrid Padilla, Yik Lun Kei, Zhi Zhang, Yanzhen Chen
TL;DR
The paper tackles conditional variance estimation and confidence interval construction in nonparametric regression using dense ReLU networks. It proposes a residual-based framework that yields non-asymptotic bounds under heteroscedastic and homoscedastic noise, relaxing sub-Gaussian assumptions to sub-Exponential. For ReLU estimators, the work derives non-asymptotic bounds for both the conditional mean $f^*(x)$ and the conditional variance $g^*(x)$, marking the first variance-estimation results for ReLU networks, and introduces a bootstrap-based confidence interval with guaranteed coverage. Empirical results on simulations and a real California housing dataset demonstrate improved variance estimation accuracy and reliable, efficient confidence intervals compared to strong baselines.
Abstract
This paper addresses the problems of conditional variance estimation and confidence interval construction in nonparametric regression using dense networks with the Rectified Linear Unit (ReLU) activation function. We present a residual-based framework for conditional variance estimation, deriving nonasymptotic bounds for variance estimation under both heteroscedastic and homoscedastic settings. We relax the sub-Gaussian noise assumption, allowing the proposed bounds to accommodate sub-Exponential noise and beyond. Building on this, for a ReLU neural network estimator, we derive non-asymptotic bounds for both its conditional mean and variance estimation, representing the first result for variance estimation using ReLU networks. Furthermore, we develop a ReLU network based robust bootstrap procedure (Efron, 1992) for constructing confidence intervals for the true mean that comes with a theoretical guarantee on the coverage, providing a significant advancement in uncertainty quantification and the construction of reliable confidence intervals in deep learning settings.
