Online estimation of the inverse of the Hessian for stochastic optimization with application to universal stochastic Newton algorithms
Antoine Godichon-Baggioni, Wei Lu, Bruno Portier
TL;DR
The paper tackles online second-order stochastic optimization by directly estimating the inverse Hessian $H^{-1}$ via a Robbins-Monro recursion, avoiding explicit Hessian inversion and achieving $\mathcal{O}(d^2)$ per-iteration complexity with a randomized update using $Z_n$. It introduces the Universal Stochastic Newton Algorithm (USNA) and its weighted averaged variant UWASNA, proving consistency, convergence rates, and asymptotic efficiency for the parameter $\theta$, as well as rates for the Hessian inverse estimates. Through extensive simulations and real-data experiments, the methods demonstrate competitive performance against Riccati-based stochastic Newton algorithms and provide viable options when Riccati updates are infeasible (e.g., spherical distributions, $p$-means). The results highlight the practical impact of a Riccati-free, online second-order approach for diverse stochastic optimization problems, including logistic regression, geometric median, and higher-order statistical functionals. The framework is supported by rigorous proofs detailing convergence, rate results, and stability properties under clearly stated assumptions.
Abstract
This paper addresses second-order stochastic optimization for estimating the minimizer of a convex function written as an expectation. A direct recursive estimation technique for the inverse Hessian matrix using a Robbins-Monro procedure is introduced. This approach enables to drastically reduces computational complexity. Above all, it allows to develop universal stochastic Newton methods and investigate the asymptotic efficiency of the proposed approach. This work so expands the application scope of secondorder algorithms in stochastic optimization.
