A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

Chuan He; Heng Huang; Zhaosong Lu

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

Chuan He, Heng Huang, Zhaosong Lu

TL;DR

This work develops a Newton-CG based barrier-augmented Lagrangian framework for general nonconvex conic optimization, aiming to compute approximate second-order stationary points. By integrating a preconditioned Newton-CG subsolver with barrier and augmented-Lagrangian techniques, the authors derive worst-case iteration and operation complexity bounds, including $\widetilde{\mathcal{O}}(\varepsilon^{-11/2})$ inner iterations (and $\widetilde{\mathcal{O}}(\varepsilon^{-7/2})$ under a constraint qualification) to obtain an $(\varepsilon,\sqrt{\varepsilon})$-SOSP. The method is backed by a rigorous analysis of optimality conditions under a verifiable CQ and a comprehensive complexity framework for the subproblem solver and AL outer loop; preliminary numerical results on low-rank recovery and NMF problems show improved solution quality over first-order methods. Overall, the paper advances the theory and practice of scalable second-order methods for nonconvex conic programs, offering practical algorithms with provable guarantees for challenging constrained settings.

Abstract

In this paper we consider finding an approximate second-order stationary point (SOSP) of general nonconvex conic optimization that minimizes a twice differentiable function subject to nonlinear equality constraints and also a convex conic constraint. In particular, we propose a Newton-conjugate gradient (Newton-CG) based barrier-augmented Lagrangian method for finding an approximate SOSP of this problem. Under some mild assumptions, we show that our method enjoys a total inner iteration complexity of $\widetilde{\cal O}(ε^{-11/2})$ and an operation complexity of $\widetilde{\cal O}(ε^{-11/2}\min\{n,ε^{-5/4}\})$ for finding an $(ε,\sqrtε)$-SOSP of general nonconvex conic optimization with high probability. Moreover, under a constraint qualification, these complexity bounds are improved to $\widetilde{\cal O}(ε^{-7/2})$ and $\widetilde{\cal O}(ε^{-7/2}\min\{n,ε^{-3/4}\})$, respectively. To the best of our knowledge, this is the first study on the complexity of finding an approximate SOSP of general nonconvex conic optimization. Preliminary numerical results are presented to demonstrate superiority of the proposed method over first-order methods in terms of solution quality.

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

TL;DR

inner iterations (and

under a constraint qualification) to obtain an

-SOSP. The method is backed by a rigorous analysis of optimality conditions under a verifiable CQ and a comprehensive complexity framework for the subproblem solver and AL outer loop; preliminary numerical results on low-rank recovery and NMF problems show improved solution quality over first-order methods. Overall, the paper advances the theory and practice of scalable second-order methods for nonconvex conic programs, offering practical algorithms with provable guarantees for challenging constrained settings.

Abstract

and an operation complexity of

for finding an

-SOSP of general nonconvex conic optimization with high probability. Moreover, under a constraint qualification, these complexity bounds are improved to

and

, respectively. To the best of our knowledge, this is the first study on the complexity of finding an approximate SOSP of general nonconvex conic optimization. Preliminary numerical results are presented to demonstrate superiority of the proposed method over first-order methods in terms of solution quality.

Paper Structure (21 sections, 21 theorems, 116 equations, 6 figures, 3 tables, 4 algorithms)

This paper contains 21 sections, 21 theorems, 116 equations, 6 figures, 3 tables, 4 algorithms.

Introduction
Notation and preliminaries
Optimality conditions
A preconditioned Newton-CG method for barrier problems
Iteration and operation complexity of Algorithm \ref{['alg:NCG']}
A Newton-CG based barrier-AL method for problem \ref{['model:equa-cnstr']}
A Newton-CG based barrier-AL method
Outer iteration complexity of Algorithm \ref{['alg:2nd-order-AL-nonconvex']}
Total inner iteration and operation complexity of Algorithm \ref{['alg:2nd-order-AL-nonconvex']}
Enhanced complexity of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} under constraint qualification
Numerical results
Low-rank matrix recovery
A simplex-constrained nonnegative matrix factorization
A sphere-constrained nonnegative matrix factorization
Proof of the main results
...and 6 more sections

Key Result

Lemma 2.1

For any $x\in\mathop{\mathrm{\mathcal{K}}}\nolimits$, it holds that $\{x+M^{1/2}d:\|d\|<1\}\subseteq\mathop{\mathrm{\mathcal{K}}}\nolimits$ for all $M\in\nabla^{-2} B(x)$.

Figures (6)

Figure 1: Left: The total number of inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} for finding a $(10^{-4},10^{-2})$-SOSP of \ref{['ms-reform']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA for finding a $10^{-4}$-FOSP of \ref{['ms']} for each problem size over 10 random instances.
Figure 2: Numerical results of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} and SpaRSA on a single random instance of problem \ref{['ms']} with $(n,l,m)=(20,2,80)$. These two figures illustrate the convergence behavior of both methods in terms of objective value $\frac{1}{2}\|\mathop{\mathrm{\mathcal{A}}}\nolimits(U^t(U^t)^T)-y\|^2$ and feasibility $[\|U^t\|_F^2-b]_+$.
Figure 3: Left: The total inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} before finding a $(10^{-4},10^{-2})$-SOSP of \ref{['nmf']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA before finding a $10^{-4}$-FOSP of \ref{['nmf']} for each problem size over 10 random instances.
Figure 4: Numerical results of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} and SpaRSA on a single random instance of problem \ref{['nmf']} with $(n,l,m)=(20,2,20)$. These two figures illustrate the convergence behavior of both methods in terms of objective value $\frac{1}{2}\|X-U^tV^t\|_F^2+\gamma(\|U^t\|_F^2+\|V^t\|_F^2)$ and feasibility $\|(V^t)^Te_l-e_m\|$.
Figure 5: Left: The total inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} before finding a $(10^{-4},10^{-2})$-SOSP of \ref{['sph-nmf']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA before finding a $10^{-4}$-FOSP of \ref{['sph-nmf']} for each problem size over 10 random instances.
...and 1 more figures

Theorems & Definitions (45)

Lemma 2.1
Theorem 3.1: first- and second-order optimality conditions
Remark 3.1
Definition 3.1: $\epsilon_1$-first-order stationary point
Definition 3.2: $(\epsilon_1,\epsilon_2)$-second-order stationary point
Remark 3.2
Theorem 4.1: Complexity of Algorithm \ref{['alg:NCG']}
Remark 5.1
Remark 5.2
Lemma 5.1: Properties of $\mathop{\mathrm{\mathcal{L}}}\nolimits_{\mu_k}(\cdot,\lambda^k;\rho_k)$ and $\mathop{\mathrm{\mathcal{L}}}\nolimits(\cdot,\lambda^k;\rho_k)$
...and 35 more

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

TL;DR

Abstract

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (45)