A linesearch-type normal map-based semismooth Newton method for nonsmooth nonconvex composite optimization
Hanfeng Zeng, Wenqing Ouyang, Andre Milzarek
TL;DR
This paper develops a linesearch-type normal-map semismooth Newton method for nonsmooth, nonconvex composite optimization of the form $\psi(x)=f(x)+\varphi(x)$, where $f$ is smooth and $\varphi$ is convex and lsc. It introduces an adaptive Lipschitz estimation strategy to avoid explicit $L$ computations and proves global convergence, convergence under the Kurdyka-Łojasiewicz property, and local q-superlinear convergence. The algorithm relies on a symmetric linear system derived from the normal map via $F^{\lambda}_{\mathrm{nor}}(z)$ and a CG-based inexact solve, with a linesearch ensuring descent of a merit function $H(\tau,z)$ and a smooth transition to fast local convergence. Numerical experiments on sparse logistic regression, nonlinear image compression, and group-sparse nonlinear least squares demonstrate competitive performance and robustness, highlighting practical impact for large-scale nonsmooth optimization.
Abstract
We propose a novel linesearch variant of the trust region normal map-based semismooth Newton method developed in [Ouyang and Milzarek, Math. Program. 212(1-2), 389--435 (2025)] for solving a class of nonsmooth, nonconvex composite-type optimization problems. Our approach uses adaptive parameter estimation techniques, which allow us to avoid explicit and potentially expensive Lipschitz constant computations. We provide extensive convergence results including global convergence, convergence of the iterates under the Kurdyka-Łojasiewicz inequality, and transition to fast local q-superlinear convergence. Compared to the original trust region framework, the linesearch-based algorithm is simpler and the overall convergence analysis can be conducted under weaker assumptions -- in particular, without requiring explicit boundedness conditions on the Hessian approximations and iterates. Numerical experiments on sparse logistic regression, image compression, and nonlinear least squares with group penalty terms demonstrate the efficiency of the proposed approach.
