Table of Contents
Fetching ...

Semi-infinite Nonconvex Constrained Min-Max Optimization

Cody Melcher, Zeinab Alizadeh, Lindsey Hiett, Afrooz Jalilzadeh, Erfan Yazdandoost Hamedani

TL;DR

The paper tackles nonconvex min-max optimization with an infinite constraint set by formulating it as a semi-infinite problem. It introduces the inexact dynamic barrier primal-dual (iDB-PD) algorithm, which uses a QP subproblem, a feasibility-indicating indicator, and inner maximization approximations to update primal and dual variables, deriving global non-asymptotic convergence guarantees under a Łojasiewicz condition with exponent $\theta\in(0,1)$. The main theoretical contributions are explicit iteration complexities for achieving $\epsilon$-stationarity, $\epsilon$-feasibility, and $\epsilon$-complementarity: $\mathcal{O}(\epsilon^{-3})$, $\mathcal{O}(\epsilon^{-6\theta})$, and $\mathcal{O}(\epsilon^{-3\theta/(1-\theta)})$ respectively, with a special PL case $\theta=1/2$ giving $\mathcal{O}(\epsilon^{-3})$ for all. Empirically, iDB-PD demonstrates robust performance on robust multitask learning with task prioritization, outperforming adaptive discretization and DRO-based baselines, and highlighting its applicability to safety-critical ML and robust optimization under infinite constraints.

Abstract

Semi-Infinite Programming (SIP) has emerged as a powerful framework for modeling problems with infinite constraints, however, its theoretical development in the context of nonconvex and large-scale optimization remains limited. In this paper, we investigate a class of nonconvex min-max optimization problems with nonconvex infinite constraints, motivated by applications such as adversarial robustness and safety-constrained learning. We propose a novel inexact dynamic barrier primal-dual algorithm and establish its convergence properties. Specifically, under the assumption that the squared infeasibility residual function satisfies the Lojasiewicz inequality with exponent $θ\in (0,1)$, we prove that the proposed method achieves $\mathcal{O}(ε^{-3})$, $\mathcal{O}(ε^{-6θ})$, and $\mathcal{O}(ε^{-3θ/(1-θ)})$ iteration complexities to achieve an $ε$-approximate stationarity, infeasibility, and complementarity slackness, respectively. Numerical experiments on robust multitask learning with task priority further illustrate the practical effectiveness of the algorithm.

Semi-infinite Nonconvex Constrained Min-Max Optimization

TL;DR

The paper tackles nonconvex min-max optimization with an infinite constraint set by formulating it as a semi-infinite problem. It introduces the inexact dynamic barrier primal-dual (iDB-PD) algorithm, which uses a QP subproblem, a feasibility-indicating indicator, and inner maximization approximations to update primal and dual variables, deriving global non-asymptotic convergence guarantees under a Łojasiewicz condition with exponent . The main theoretical contributions are explicit iteration complexities for achieving -stationarity, -feasibility, and -complementarity: , , and respectively, with a special PL case giving for all. Empirically, iDB-PD demonstrates robust performance on robust multitask learning with task prioritization, outperforming adaptive discretization and DRO-based baselines, and highlighting its applicability to safety-critical ML and robust optimization under infinite constraints.

Abstract

Semi-Infinite Programming (SIP) has emerged as a powerful framework for modeling problems with infinite constraints, however, its theoretical development in the context of nonconvex and large-scale optimization remains limited. In this paper, we investigate a class of nonconvex min-max optimization problems with nonconvex infinite constraints, motivated by applications such as adversarial robustness and safety-constrained learning. We propose a novel inexact dynamic barrier primal-dual algorithm and establish its convergence properties. Specifically, under the assumption that the squared infeasibility residual function satisfies the Lojasiewicz inequality with exponent , we prove that the proposed method achieves , , and iteration complexities to achieve an -approximate stationarity, infeasibility, and complementarity slackness, respectively. Numerical experiments on robust multitask learning with task priority further illustrate the practical effectiveness of the algorithm.

Paper Structure

This paper contains 24 sections, 11 theorems, 42 equations, 4 figures, 1 table, 1 algorithm.

Key Result

Theorem 4.1

Suppose Assumptions assum:objective, assum:constraint, and assum:reg hold. Let $\{x_k,\lambda_k\}_{k\geq 0}$ be the sequence generated by Algorithm alg:sp_const such that $\{\alpha_k\}_k$ is a non-increasing sequence and $\gamma_k \leq (L_f+L_{xy}^\phi)^{-1}$. Then, for any $T\geq 1$, for some summable sequences $\{\mathcal{E}_k^y,\mathcal{E}_k^w\}_{k\geq 0}\subset \mathbb R_+$, where $\Gamma_T\t

Figures (4)

  • Figure 1: iDB-PD vs. Adaptive Discretization with COOPER on multi-MNIST (top row) and CHD49 (bottom row), evaluated in terms of stationarity, infeasibility, and slackness.
  • Figure 2: iDB--PD vs GDMA on multi-MNIST (top row) and CHD49 (bottom row), evaluated in terms of stationarity, infeasibility, and objective loss.
  • Figure 3: iDB-PD vs. GDMA on multi-Fashion MNIST (top row), Yeast (middle row), and 20NG (bottom row), evaluated in terms of stationarity, infeasibility, and objective loss.
  • Figure 4: iDB--PD vs. Adaptive Discretization with COOPER on multi-Fashion MNIST (top row), Yeast (middle row), and 20NG (bottom row), evaluated in terms of stationarity, infeasibility, and slackness.

Theorems & Definitions (23)

  • Definition 2.1: Λojasiewicz inequality
  • Remark 2.1
  • Remark 3.1
  • Theorem 4.1
  • proof
  • Theorem 4.2
  • proof
  • Remark 4.1
  • Proposition 6.1: karimi2016linearnesterov2018lecturesnecoara2019linear
  • Proposition 6.2: nouiehed2019solving Lemma A.5
  • ...and 13 more