First-Order Methods for Nonsmooth Nonconvex Functional Constrained Optimization with or without Slater Points
Zhichao Jia, Benjamin Grimmer
TL;DR
This work addresses constrained optimization with nonsmooth, nonconvex objectives and constraints on a closed convex set $X$, proposing a simple inexact proximal point method. The method uses an inner switching subgradient loop to solve proximal subproblems, guaranteeing feasibility and achieving $O(1/ε^4)$ subgradient evaluations to reach an $ε$-stationary point. It yields approximate Fritz–John points without constraint qualification and approximate KKT points under CQ (via a $ ext{σ}$-strong MFCQ bound on multipliers), without requiring compactness of the feasible set. The approach is demonstrated on sparsity-inducing SCAD constraints, showing realistic behavior under CQ failures and active/inactive constraint regimes, with potential extensions to stochastic settings and structure-aware acceleration. Overall, the paper advances first-order methods for nonsmooth nonconvex constrained optimization by guaranteeing feasibility and stationarity in broad settings, including when classical constraint qualifications fail.
Abstract
Constrained optimization problems where both the objective and constraints may be nonsmooth and nonconvex arise across many learning and data science settings. In this paper, we show for any Lipschitz, weakly convex objectives and constraints, a simple first-order method finds a feasible, $ε$-stationary point at a convergence rate of $O(ε^{-4})$ without relying on compactness or Constraint Qualification (CQ). When CQ holds, this convergence is measured by approximately satisfying the Karush-Kuhn-Tucker conditions. When CQ fails, we guarantee the attainment of weaker Fritz-John conditions. As an illustrative example, our method stably converges on piecewise quadratic SCAD regularized problems despite frequent violations of constraint qualification. The considered algorithm is similar to those of "Quadratically regularized subgradient methods for weakly convex optimization with weakly convex constraints" by Ma et al. and "Stochastic first-order methods for convex and nonconvex functional constrained optimization" by Boob et al. (whose guarantees further assume compactness and CQ), iteratively taking inexact proximal steps, computed via an inner loop applying a switching subgradient method to a strongly convex constrained subproblem. Our non-Lipschitz analysis of the switching subgradient method appears to be new and may be of independent interest.
