Table of Contents
Fetching ...

Adaptive directional decomposition methods for nonconvex constrained optimization

Qiankun Shi, Xiao Wang

TL;DR

The paper addresses nonconvex constrained optimization with both equality and inequality constraints in deterministic and stochastic settings by introducing an adaptive directional decomposition framework that splits descent into tangent-space objective reduction and normal-space constraint-violation reduction, with adaptive merit parameters and stepsizes. The deterministic part yields an $O(\epsilon^{-2})$ iteration complexity to reach an $\epsilon$-KKT point under strong LICQ, while stochastic variants (mini-batch and recursive momentum) achieve state-of-the-art high-probability oracle complexities ($\tilde O(\epsilon^{-4},\epsilon^{-6})$ and $\tilde O(\epsilon^{-3},\epsilon^{-5})$, respectively) under sample-wise smoothness assumptions. The framework is demonstrated to be effective through numerical experiments on CUTEst, with insights into the impact of the user-defined mapping $A$ and variance-reduction techniques, and it extends naturally to inequality-constrained problems via a subproblem enforcing descent in the linearized constraints. Overall, the approach provides a unified, first-order methodology with strong theoretical guarantees and practical performance for a broad class of nonconvex constrained optimization tasks.

Abstract

In this paper, we study nonconvex constrained optimization problems with both equality and inequality constraints, covering deterministic and stochastic settings. We propose a novel first-order algorithm framework that employs a decomposition strategy to balance objective reduction and constraint satisfaction, together with adaptive update of stepsizes and merit parameters. Under certain conditions, the proposed adaptive directional decomposition methods attain an iteration complexity of order \(O(ε^{-2})\) for finding an \(ε\)-KKT point in the deterministic setting. In the stochastic setting, we further develop stochastic variants of approaches and analyze their theoretical properties by leveraging the perturbation theory. We establish the high-probability oracle complexity to find an $ε$-KKT point of order \( \tilde O(ε^{-4}, ε^{-6}) \) (resp. \(\tilde O(ε^{-3}, ε^{-5}) \)) for gradient and constraint evaluations, in the absence (resp. presence) of sample-wise smoothness. To the best of our knowledge, the obtained complexity bounds are comparable to, or improve upon, the state-of-the-art results in the literature.

Adaptive directional decomposition methods for nonconvex constrained optimization

TL;DR

The paper addresses nonconvex constrained optimization with both equality and inequality constraints in deterministic and stochastic settings by introducing an adaptive directional decomposition framework that splits descent into tangent-space objective reduction and normal-space constraint-violation reduction, with adaptive merit parameters and stepsizes. The deterministic part yields an iteration complexity to reach an -KKT point under strong LICQ, while stochastic variants (mini-batch and recursive momentum) achieve state-of-the-art high-probability oracle complexities ( and , respectively) under sample-wise smoothness assumptions. The framework is demonstrated to be effective through numerical experiments on CUTEst, with insights into the impact of the user-defined mapping and variance-reduction techniques, and it extends naturally to inequality-constrained problems via a subproblem enforcing descent in the linearized constraints. Overall, the approach provides a unified, first-order methodology with strong theoretical guarantees and practical performance for a broad class of nonconvex constrained optimization tasks.

Abstract

In this paper, we study nonconvex constrained optimization problems with both equality and inequality constraints, covering deterministic and stochastic settings. We propose a novel first-order algorithm framework that employs a decomposition strategy to balance objective reduction and constraint satisfaction, together with adaptive update of stepsizes and merit parameters. Under certain conditions, the proposed adaptive directional decomposition methods attain an iteration complexity of order \(O(ε^{-2})\) for finding an -KKT point in the deterministic setting. In the stochastic setting, we further develop stochastic variants of approaches and analyze their theoretical properties by leveraging the perturbation theory. We establish the high-probability oracle complexity to find an -KKT point of order \( \tilde O(ε^{-4}, ε^{-6}) \) (resp. \(\tilde O(ε^{-3}, ε^{-5}) \)) for gradient and constraint evaluations, in the absence (resp. presence) of sample-wise smoothness. To the best of our knowledge, the obtained complexity bounds are comparable to, or improve upon, the state-of-the-art results in the literature.

Paper Structure

This paper contains 24 sections, 29 theorems, 155 equations, 3 figures, 1 table, 3 algorithms.

Key Result

Lemma 1

Under Assumption ass:cq, if $s(x) = 0$, then $x$ is a KKT point of Problem p1. Furthermore, if the the minimal singular value of $\nabla c(x) A(x)$, denoted by $\delta$, is positive and $\left \lVert s(x) \right \rVert \leq \epsilon$ for a given $\epsilon>0$, then $\left \lVert {\rm P}_{V(x)} \nabla

Figures (3)

  • Figure 1: Comparison between linearized ALM and SQP.
  • Figure 2: Comparison between Mini-batch and Recursive momentum method.
  • Figure 3: Performance of the recursive momentum approach on CUTEst problems.

Theorems & Definitions (37)

  • Definition 1
  • Definition 2
  • Lemma 1
  • Remark 1
  • Lemma 2
  • Theorem 1: Global convergence of Algorithm \ref{['alg1']}
  • Theorem 2: Iteration complexity of Algorithm \ref{['alg1']}
  • Lemma 3
  • Lemma 4
  • Remark 2
  • ...and 27 more