$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

Rahul Vaze; Abhishek Sinha

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

Rahul Vaze, Abhishek Sinha

TL;DR

This work advances constrained online convex optimization (COCO) by exploiting the geometry of nested constraint sets $S_t$ to control cumulative constraint violation (CCV) without sacrificing regret. It introduces a simple online gradient method, $\mathrm{coco\_alg\_1}$, that interleaves a gradient step with projections onto $S_{t-1}$ and $S_t$, achieving regret $O(\sqrt{T})$ and an instance-dependent CCV bound $O\big(\sqrt{d}\,(1/c^*)^{d}D\big)$, where $c^*$ captures the geometric width of the constraint set intersection. A Switch mechanism combines this with a universal COCO method from Sinha2024 to guarantee CCV no larger than $\min\{\mathcal{V}, O(\sqrt{T}\log T)\}$, providing best-of-both-worlds performance. For favorable geometries (e.g., spheres or axis-aligned $S_t$) and especially in 2D with monotone projection angles, CCV is tightly $O(1)$, and in the OCS special case, CCV improves to $O(1)$. These results substantially sharpen prior universal CCV bounds and highlight the role of constraint geometry in COCO performance.

Abstract

The constrained version of the standard online convex optimization (OCO) framework, called COCO is considered, where on every round, a convex cost function and a convex constraint function are revealed to the learner after it chooses the action for that round. The objective is to simultaneously minimize the static regret and cumulative constraint violation (CCV). An algorithm is proposed that guarantees a static regret of $O(\sqrt{T})$ and a CCV of $\min\{\cV, O(\sqrt{T}\log T) \}$, where $\cV$ depends on the distance between the consecutively revealed constraint sets, the shape of constraint sets, dimension of action space and the diameter of the action space. For special cases of constraint sets, $\cV=O(1)$. Compared to the state of the art results, static regret of $O(\sqrt{T})$ and CCV of $O(\sqrt{T}\log T)$, that were universal, the new result on CCV is instance dependent, which is derived by exploiting the geometric properties of the constraint sets.

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

TL;DR

This work advances constrained online convex optimization (COCO) by exploiting the geometry of nested constraint sets

to control cumulative constraint violation (CCV) without sacrificing regret. It introduces a simple online gradient method,

, that interleaves a gradient step with projections onto

and

, achieving regret

and an instance-dependent CCV bound

, where

captures the geometric width of the constraint set intersection. A Switch mechanism combines this with a universal COCO method from Sinha2024 to guarantee CCV no larger than

, providing best-of-both-worlds performance. For favorable geometries (e.g., spheres or axis-aligned

) and especially in 2D with monotone projection angles, CCV is tightly

, and in the OCS special case, CCV improves to

. These results substantially sharpen prior universal CCV bounds and highlight the role of constraint geometry in COCO performance.

Abstract

and a CCV of

, where

depends on the distance between the consecutively revealed constraint sets, the shape of constraint sets, dimension of action space and the diameter of the action space. For special cases of constraint sets,

. Compared to the state of the art results, static regret of

and CCV of

, that were universal, the new result on CCV is instance dependent, which is derived by exploiting the geometric properties of the constraint sets.

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

TL;DR

Abstract

$O(\sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (38)