A Stochastic Newton-type Method for Non-smooth Optimization
Titus Pinta
TL;DR
This work develops a stochastic, non-smooth Newton-type framework in which randomness enters solely through Hessian approximations. By leveraging stochastic process tools and a backtracking step, it derives finite-iteration and tail bounds for achieving approximate first-order optimality without requiring unbiased Hessian estimators or finite variance, and demonstrates practical effectiveness in XFEL tomography and image denoising via both random-noise and sketching approaches. The results show that stochastic Quasi-Newton methods can outperform traditional first-order methods in large-scale or physics-driven settings, with robust convergence guarantees under realistic regularity assumptions. Overall, the paper broadens the applicability of Newton-type methods to stochastic, non-smooth optimization and large-scale problems, offering rigorous performance guarantees and versatile algorithmic templates.
Abstract
We introduce a new framework for analyzing (Quasi-}Newton type methods applied to non-smooth optimization problems. The source of randomness comes from the evaluation of the (approximation) of the Hessian. We derive, using a variant of Chernoff bounds for stopping times, expectation and probability bounds for the random variable representing the number of iterations of the algorithm until approximate first order optimality conditions are validated. As an important distinction to previous results in the literature, we do not require that the estimator is unbiased or that it has finite variance. We then showcase our theoretical results in a stochastic Quasi-Newton method for X-ray free electron laser orbital tomography and in a sketched Newton method for image denoising.
