Table of Contents
Fetching ...

Optimistic Noise-Aware Sequential Quadratic Programming for Equality Constrained Optimization with Rank-Deficient Jacobians

Albert S. Berahas, Jiahao Shi, Baoyu Zhou

TL;DR

The paper addresses noisy equality-constrained optimization where both the objective and constraints, as well as their derivatives, are contaminated by bounded noise and the constraint Jacobian may be rank-deficient. It develops an optimistic, noise-aware SQP framework with a Byrd-Omojokun-style step decomposition into normal and tangential components, two adaptable step-size schemes, and an early stopping rule based on optimistic feasibility. Theoretical results establish neighborhood convergence to a first-order stationary point in the full-rank LICQ setting and convergence to a neighborhood of an infeasible stationary point when rank-deficiency occurs, with nonasymptotic bounds and iteration- complexity guarantees. Numerical experiments on CUTEst show robustness to noise and competitive performance against existing noise-aware SQP methods, including effective handling of rank-deficient scenarios and early termination benefits. Overall, the approach provides practical, theoretically grounded guidance for solving noisy equality-constrained problems in the presence of rank deficiencies and measurement noise.

Abstract

We propose and analyze a sequential quadratic programming algorithm for minimizing a noisy nonlinear smooth function subject to noisy nonlinear smooth equality constraints. The algorithm uses a step decomposition strategy and, as a result, is robust to potential rank-deficiency in the constraints, allows for two different step size strategies, and has an early stopping mechanism. Under the linear independence constraint qualification, convergence is established to a neighborhood of a first-order stationary point, where the radius of the neighborhood is proportional to the noise levels in the objective function and constraints. Moreover, in the rank-deficient setting, the merit parameter may converge to zero, and convergence to a neighborhood of an infeasible stationary point is established. Numerical experiments demonstrate the efficiency and robustness of the proposed method.

Optimistic Noise-Aware Sequential Quadratic Programming for Equality Constrained Optimization with Rank-Deficient Jacobians

TL;DR

The paper addresses noisy equality-constrained optimization where both the objective and constraints, as well as their derivatives, are contaminated by bounded noise and the constraint Jacobian may be rank-deficient. It develops an optimistic, noise-aware SQP framework with a Byrd-Omojokun-style step decomposition into normal and tangential components, two adaptable step-size schemes, and an early stopping rule based on optimistic feasibility. Theoretical results establish neighborhood convergence to a first-order stationary point in the full-rank LICQ setting and convergence to a neighborhood of an infeasible stationary point when rank-deficiency occurs, with nonasymptotic bounds and iteration- complexity guarantees. Numerical experiments on CUTEst show robustness to noise and competitive performance against existing noise-aware SQP methods, including effective handling of rank-deficient scenarios and early termination benefits. Overall, the approach provides practical, theoretically grounded guidance for solving noisy equality-constrained problems in the presence of rank deficiencies and measurement noise.

Abstract

We propose and analyze a sequential quadratic programming algorithm for minimizing a noisy nonlinear smooth function subject to noisy nonlinear smooth equality constraints. The algorithm uses a step decomposition strategy and, as a result, is robust to potential rank-deficiency in the constraints, allows for two different step size strategies, and has an early stopping mechanism. Under the linear independence constraint qualification, convergence is established to a neighborhood of a first-order stationary point, where the radius of the neighborhood is proportional to the noise levels in the objective function and constraints. Moreover, in the rank-deficient setting, the merit parameter may converge to zero, and convergence to a neighborhood of an infeasible stationary point is established. Numerical experiments demonstrate the efficiency and robustness of the proposed method.

Paper Structure

This paper contains 14 sections, 24 theorems, 185 equations, 8 figures, 4 tables, 1 algorithm.

Key Result

Lemma 3.6

Suppose that Assumptions ass:prob and ass.noearlytermination hold. If $\|\bar{c}_k\|> \epsilon_o$, there exist $\sigma_v \in \mathbb{R}_{>0}$ and $\underline{\sigma}_{Jc} \in \mathbb{R}_{>0}$ such that

Figures (8)

  • Figure 1: Box plots illustrating feasibility errors (left) and stationarity errors (right) for exact and inexact AdaSQP-opt and LSSQP-opt methods on CUTEst problems that satisfy the LICQ. First row: $\epsilon_c = 10^{-4}$ and $\epsilon_f \in \{10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$. Second row: $\epsilon_f = 10^{-4}$ and $\epsilon_c \in \{10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$.
  • Figure 2: Box plots illustrating feasibility errors (left), stationarity errors (middle) and infeasible stationarity error (right) for exact and inexact AdaSQP-opt and LSSQP-opt methods on CUTEst problems that violate the LICQ. First row: $\epsilon_c = 10^{-4}$ and $\epsilon_f \in \{10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$. Second row: $\epsilon_f = 10^{-4}$ and $\epsilon_c \in \{10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$.
  • Figure 3: Dolan-Moré performance profiles comparing AdaSQP-pes, AdaSQP-opt, LSSQP-pes, LSSQP-opt and NTSQP on CUTEst collection of test problems that satisfy the LICQ in terms of number of function evaluations for $\epsilon_c \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from left to right) and $\epsilon_f \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from top to bottom).
  • Figure 4: Dolan-Moré performance profiles comparing AdaSQP-pes, AdaSQP-opt, LSSQP-pes, LSSQP-opt and NTSQP on CUTEst collection of test problems that satisfy the LICQ in terms of MINRES iterations for $\epsilon_c \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from left to right) and $\epsilon_f \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from top to bottom).
  • Figure 5: Dolan-Moré performance profiles comparing AdaSQP-pes, AdaSQP-opt, LSSQP-pes, and LSSQP-opt on CUTEst collection of test problems in the absence of the LICQ in terms of function evaluations for $\epsilon_c \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from left to right) and $\epsilon_f \in \{ 10^{-1}, 10^{-2}, 10^{-4}, 10^{-8}\}$ (from top to bottom).
  • ...and 3 more figures

Theorems & Definitions (64)

  • Remark 2.3
  • Remark 2.5
  • Remark 2.6
  • Remark 3.3
  • Remark 3.5
  • Lemma 3.6
  • proof
  • Lemma 3.7
  • proof
  • Lemma 3.8
  • ...and 54 more