High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization
Wanrong Zhu, Zhipeng Lou, Ziyang Wei, Wei Biao Wu
TL;DR
This work addresses uncertainty quantification for online stochastic optimization by introducing a parallel-run inference framework that builds a $t$-based confidence interval for any linear functional $\upsilon^{\top}x^{*}$ using $K$ independent runs. The method updates a parallel average $\bar{x}_{K,n}$ and a variance surrogate $\widehat{\sigma}_{\upsilon}^{2}$ to form $\widehat{CI}_{\upsilon}=\left[\upsilon^{\top}\bar{x}_{K,n}-\frac{t_{1-\alpha/2,K-1}\widehat{\sigma}_{\upsilon}}{\sqrt{K}},\ \upsilon^{\top}\bar{x}_{K,n}+\frac{t_{1-\alpha/2,K-1}\widehat{\sigma}_{\upsilon}}{\sqrt{K}}\right]$, with the corresponding $t$-statistic converging to $t_{K-1}$. The theoretical core is a Gaussian approximation for online estimators that yields explicit rates for the relative coverage error $\Delta_{\alpha}$ and shows the $t$-statistic is asymptotically pivotal, enabling valid high-confidence inference in online settings. Empirical results on linear and logistic regression, plus a MNIST-based mean image task, demonstrate accurate coverage, competitive interval lengths, and substantial computational savings from the near 'cost-free' inference, especially when leveraging parallel computing. The approach is easily integrated into existing stochastic algorithms and is well-suited to large-scale, streaming, or federated contexts where parallelism is natural.
Abstract
Uncertainty quantification for estimation through stochastic optimization solutions in an online setting has gained popularity recently. This paper introduces a novel inference method focused on constructing confidence intervals with efficient computation and fast convergence to the nominal level. Specifically, we propose to use a small number of independent multi-runs to acquire distribution information and construct a t-based confidence interval. Our method requires minimal additional computation and memory beyond the standard updating of estimates, making the inference process almost cost-free. We provide a rigorous theoretical guarantee for the confidence interval, demonstrating that the coverage is approximately exact with an explicit convergence rate and allowing for high confidence level inference. In particular, a new Gaussian approximation result is developed for the online estimators to characterize the coverage properties of our confidence intervals in terms of relative errors. Additionally, our method also allows for leveraging parallel computing to further accelerate calculations using multiple cores. It is easy to implement and can be integrated with existing stochastic algorithms without the need for complicated modifications.
