Table of Contents
Fetching ...

Optimal Primal-Dual Algorithm with Last iterate Convergence Guarantees for Stochastic Convex Optimization Problems

Digvijay Boob, Mohammad Khalafi

Abstract

This paper proposes a novel first-order algorithm that solves composite nonsmooth and stochastic convex optimization problem with function constraints. Most of the works in the literature provide convergence rate guarantees on the average-iterate solution. There is growing interest in the convergence guarantees of the last iterate solution due to its favorable structural properties, such as sparsity or privacy guarantees and good performance in practice. We provide the first method that obtains the best-known convergence rate guarantees on the last iterate for stochastic composite nonsmooth convex function-constrained optimization problems. Our novel and easy-to-implement algorithm is based on the augmented Lagrangian technique and uses a new linearized approximation of constraint functions, leading to its name, the Augmented Constraint Extrapolation (Aug-ConEx) method. We show that Aug-ConEx achieves $\mathcal{O}(1/\sqrt{K})$ convergence rate in the nonsmooth stochastic setting without any strong convexity assumption and $\mathcal{O}(1/K)$ for the same problem with strongly convex objective function. While optimal for nonsmooth and stochastic problems, the Aug-ConEx method also accelerates convergence in terms of Lipschitz smoothness constants to $\mathcal{O}(1/K)$ and $\mathcal{O}(1/K^2)$ in the aforementioned cases, respectively. To our best knowledge, this is the first method to obtain such differentiated convergence rate guarantees on the last iterate for a composite nonsmooth stochastic setting without additional $\log{K}$ factors. We validate the efficiency of our algorithm by comparing it with a state-of-the-art algorithm through numerical experiments.

Optimal Primal-Dual Algorithm with Last iterate Convergence Guarantees for Stochastic Convex Optimization Problems

Abstract

This paper proposes a novel first-order algorithm that solves composite nonsmooth and stochastic convex optimization problem with function constraints. Most of the works in the literature provide convergence rate guarantees on the average-iterate solution. There is growing interest in the convergence guarantees of the last iterate solution due to its favorable structural properties, such as sparsity or privacy guarantees and good performance in practice. We provide the first method that obtains the best-known convergence rate guarantees on the last iterate for stochastic composite nonsmooth convex function-constrained optimization problems. Our novel and easy-to-implement algorithm is based on the augmented Lagrangian technique and uses a new linearized approximation of constraint functions, leading to its name, the Augmented Constraint Extrapolation (Aug-ConEx) method. We show that Aug-ConEx achieves convergence rate in the nonsmooth stochastic setting without any strong convexity assumption and for the same problem with strongly convex objective function. While optimal for nonsmooth and stochastic problems, the Aug-ConEx method also accelerates convergence in terms of Lipschitz smoothness constants to and in the aforementioned cases, respectively. To our best knowledge, this is the first method to obtain such differentiated convergence rate guarantees on the last iterate for a composite nonsmooth stochastic setting without additional factors. We validate the efficiency of our algorithm by comparing it with a state-of-the-art algorithm through numerical experiments.

Paper Structure

This paper contains 18 sections, 10 theorems, 120 equations, 3 figures, 2 tables, 2 algorithms.

Key Result

Lemma 1

If $L_k\geq 2\rho_k(M_g + M_{\chi})^2$, then $F$ is a contraction operator satisfying satisfying $\gnorm{F(\mathbf{x}_1) - F(\mathbf{x}_2)}{}{} \le \tfrac{1}{2} \gnorm{\mathbf{x}_1-\mathbf{x}_2}{}{}$. Moreover, $x_{k+1}$ is the unique fixed point of $F$.

Figures (3)

  • Figure 1: Performance of $\text{Aug-ConEx}$ vs ConEx in terms of optimality and feasibility gaps on 10 i.i.d instances of sparse QCQP using the proximal operator in Algorithm \ref{['alg:implicit']}.
  • Figure 2: Performance of $\text{Aug-ConEx}$ vs ConEx in terms of optimality and feasibility gaps on 10 i.i.d instances of sparse QCQP using projection in Algorithm \ref{['alg:implicit']}.
  • Figure 3: Number of iterations in Algorithm \ref{['alg:implicit']} with $K=9000$ to converge to the fixed points of \ref{['eq:conceptual-x-s-update']} in Convex setting.

Theorems & Definitions (17)

  • Definition 1
  • Lemma 1
  • Theorem 1
  • Remark 1
  • Theorem 2
  • Corollary 1
  • proof
  • Theorem 3
  • Corollary 2
  • proof
  • ...and 7 more