Online Covariance Matrix Estimation in Sketched Newton Methods
Wei Kuang, Mihai Anitescu, Sen Na
TL;DR
This work develops an online, fully online covariance estimator for sketched Newton methods in stochastic optimization, enabling online inference from second-order updates without costly Hessian inversions. By leveraging a weighted, batch-free covariance estimator constructed from Newton iterates, the authors establish consistency and convergence rates, and show that online confidence intervals can be built directly from the iterates. The method remains efficient even with sketching and adaptive step sizes, and extends naturally to constrained problems and sketched SQP. Empirical results on linear and logistic regression, as well as CUTEst benchmarks, demonstrate accurate covariance estimation and reliable confidence intervals, highlighting the practical impact for streaming, large-scale, second-order online learning.
Abstract
Given the ubiquity of streaming data, online algorithms have been widely used for parameter estimation, with second-order methods particularly standing out for their efficiency and robustness. In this paper, we study an online sketched Newton method that leverages a randomized sketching technique to perform an approximate Newton step in each iteration, thereby eliminating the computational bottleneck of second-order methods. While existing studies have established the asymptotic normality of sketched Newton methods, a consistent estimator of the limiting covariance matrix remains an open problem. We propose a fully online covariance matrix estimator that is constructed entirely from the Newton iterates and requires no matrix factorization. Compared to covariance estimators for first-order online methods, our estimator for second-order methods is batch-free. We establish the consistency and convergence rate of our estimator, and coupled with asymptotic normality results, we can then perform online statistical inference for the model parameters based on sketched Newton methods. We also discuss the extension of our estimator to constrained problems, and demonstrate its superior performance on regression problems as well as benchmark problems in the CUTEst set.
