Stochastic Gradient Descent for Constrained Optimization based on Adaptive Relaxed Barrier Functions
Naum Dimitrieski, Jing Cao, Christian Ebenbauer
TL;DR
This work proposes a stochastic gradient method for constrained optimization that uses an adaptive relaxed barrier function to handle inequality constraints with partial gradient information. By sampling both objective components and constraints and updating via $x_{k+1}=x_k-\gamma_k(\nabla f_{i_k}(x_k)+\nabla B(g_{j_k}(x_k),\delta_k))$, the authors prove almost-sure convergence to the central-path limit $x^{*}(\delta_\infty)$ under strong convexity of $f$ and affine constraints, given appropriate step-size and barrier-decay schemes. The relaxed barrier $B(z,\delta)$ enables avoidance of projections and maintains constant memory regardless of the number of constraints, with simulations showing favorable performance for problems with a very large number of constraints. The results provide a first theoretical and numerical demonstration of adaptive relaxed barrier SGD as a scalable approach to constrained optimization in large-scale settings. The approach has potential impact in deep learning and parallel computing contexts where constraint sets are enormous and full constraint information is impractical to process each iteration.
Abstract
This paper presents a novel stochastic gradient descent algorithm for constrained optimization. The proposed algorithm randomly samples constraints and components of the finite sum objective function and relies on a relaxed logarithmic barrier function that is appropriately adapted in each optimization iteration. For a strongly convex objective function and affine inequality constraints, step-size rules and barrier adaptation rules are established that guarantee asymptotic convergence with probability one. The theoretical results in the paper are complemented by numerical studies which highlight potential advantages of the proposed algorithm for optimization problems with a large number of constraints.
