Table of Contents
Fetching ...

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

Chuan He, Heng Huang, Zhaosong Lu

TL;DR

This work develops a Newton-CG based barrier-augmented Lagrangian framework for general nonconvex conic optimization, aiming to compute approximate second-order stationary points. By integrating a preconditioned Newton-CG subsolver with barrier and augmented-Lagrangian techniques, the authors derive worst-case iteration and operation complexity bounds, including $\widetilde{\mathcal{O}}(\varepsilon^{-11/2})$ inner iterations (and $\widetilde{\mathcal{O}}(\varepsilon^{-7/2})$ under a constraint qualification) to obtain an $(\varepsilon,\sqrt{\varepsilon})$-SOSP. The method is backed by a rigorous analysis of optimality conditions under a verifiable CQ and a comprehensive complexity framework for the subproblem solver and AL outer loop; preliminary numerical results on low-rank recovery and NMF problems show improved solution quality over first-order methods. Overall, the paper advances the theory and practice of scalable second-order methods for nonconvex conic programs, offering practical algorithms with provable guarantees for challenging constrained settings.

Abstract

In this paper we consider finding an approximate second-order stationary point (SOSP) of general nonconvex conic optimization that minimizes a twice differentiable function subject to nonlinear equality constraints and also a convex conic constraint. In particular, we propose a Newton-conjugate gradient (Newton-CG) based barrier-augmented Lagrangian method for finding an approximate SOSP of this problem. Under some mild assumptions, we show that our method enjoys a total inner iteration complexity of $\widetilde{\cal O}(ε^{-11/2})$ and an operation complexity of $\widetilde{\cal O}(ε^{-11/2}\min\{n,ε^{-5/4}\})$ for finding an $(ε,\sqrtε)$-SOSP of general nonconvex conic optimization with high probability. Moreover, under a constraint qualification, these complexity bounds are improved to $\widetilde{\cal O}(ε^{-7/2})$ and $\widetilde{\cal O}(ε^{-7/2}\min\{n,ε^{-3/4}\})$, respectively. To the best of our knowledge, this is the first study on the complexity of finding an approximate SOSP of general nonconvex conic optimization. Preliminary numerical results are presented to demonstrate superiority of the proposed method over first-order methods in terms of solution quality.

A Newton-CG based barrier-augmented Lagrangian method for general nonconvex conic optimization

TL;DR

This work develops a Newton-CG based barrier-augmented Lagrangian framework for general nonconvex conic optimization, aiming to compute approximate second-order stationary points. By integrating a preconditioned Newton-CG subsolver with barrier and augmented-Lagrangian techniques, the authors derive worst-case iteration and operation complexity bounds, including inner iterations (and under a constraint qualification) to obtain an -SOSP. The method is backed by a rigorous analysis of optimality conditions under a verifiable CQ and a comprehensive complexity framework for the subproblem solver and AL outer loop; preliminary numerical results on low-rank recovery and NMF problems show improved solution quality over first-order methods. Overall, the paper advances the theory and practice of scalable second-order methods for nonconvex conic programs, offering practical algorithms with provable guarantees for challenging constrained settings.

Abstract

In this paper we consider finding an approximate second-order stationary point (SOSP) of general nonconvex conic optimization that minimizes a twice differentiable function subject to nonlinear equality constraints and also a convex conic constraint. In particular, we propose a Newton-conjugate gradient (Newton-CG) based barrier-augmented Lagrangian method for finding an approximate SOSP of this problem. Under some mild assumptions, we show that our method enjoys a total inner iteration complexity of and an operation complexity of for finding an -SOSP of general nonconvex conic optimization with high probability. Moreover, under a constraint qualification, these complexity bounds are improved to and , respectively. To the best of our knowledge, this is the first study on the complexity of finding an approximate SOSP of general nonconvex conic optimization. Preliminary numerical results are presented to demonstrate superiority of the proposed method over first-order methods in terms of solution quality.
Paper Structure (21 sections, 21 theorems, 116 equations, 6 figures, 3 tables, 4 algorithms)

This paper contains 21 sections, 21 theorems, 116 equations, 6 figures, 3 tables, 4 algorithms.

Key Result

Lemma 2.1

For any $x\in\mathop{\mathrm{\mathcal{K}}}\nolimits$, it holds that $\{x+M^{1/2}d:\|d\|<1\}\subseteq\mathop{\mathrm{\mathcal{K}}}\nolimits$ for all $M\in\nabla^{-2} B(x)$.

Figures (6)

  • Figure 1: Left: The total number of inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} for finding a $(10^{-4},10^{-2})$-SOSP of \ref{['ms-reform']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA for finding a $10^{-4}$-FOSP of \ref{['ms']} for each problem size over 10 random instances.
  • Figure 2: Numerical results of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} and SpaRSA on a single random instance of problem \ref{['ms']} with $(n,l,m)=(20,2,80)$. These two figures illustrate the convergence behavior of both methods in terms of objective value $\frac{1}{2}\|\mathop{\mathrm{\mathcal{A}}}\nolimits(U^t(U^t)^T)-y\|^2$ and feasibility $[\|U^t\|_F^2-b]_+$.
  • Figure 3: Left: The total inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} before finding a $(10^{-4},10^{-2})$-SOSP of \ref{['nmf']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA before finding a $10^{-4}$-FOSP of \ref{['nmf']} for each problem size over 10 random instances.
  • Figure 4: Numerical results of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} and SpaRSA on a single random instance of problem \ref{['nmf']} with $(n,l,m)=(20,2,20)$. These two figures illustrate the convergence behavior of both methods in terms of objective value $\frac{1}{2}\|X-U^tV^t\|_F^2+\gamma(\|U^t\|_F^2+\|V^t\|_F^2)$ and feasibility $\|(V^t)^Te_l-e_m\|$.
  • Figure 5: Left: The total inner iterations of Algorithm \ref{['alg:2nd-order-AL-nonconvex']} before finding a $(10^{-4},10^{-2})$-SOSP of \ref{['sph-nmf']} for each problem size over 10 random instances. Right: The number of iterations of SpaRSA before finding a $10^{-4}$-FOSP of \ref{['sph-nmf']} for each problem size over 10 random instances.
  • ...and 1 more figures

Theorems & Definitions (45)

  • Lemma 2.1
  • Theorem 3.1: first- and second-order optimality conditions
  • Remark 3.1
  • Definition 3.1: $\epsilon_1$-first-order stationary point
  • Definition 3.2: $(\epsilon_1,\epsilon_2)$-second-order stationary point
  • Remark 3.2
  • Theorem 4.1: Complexity of Algorithm \ref{['alg:NCG']}
  • Remark 5.1
  • Remark 5.2
  • Lemma 5.1: Properties of $\mathop{\mathrm{\mathcal{L}}}\nolimits_{\mu_k}(\cdot,\lambda^k;\rho_k)$ and $\mathop{\mathrm{\mathcal{L}}}\nolimits(\cdot,\lambda^k;\rho_k)$
  • ...and 35 more