Constrained Online Convex Optimization with Memory and Predictions

Mohammed Abdullah; George Iosifidis; Salah Eddine Elayoubi; Tijani Chahed

Constrained Online Convex Optimization with Memory and Predictions

Mohammed Abdullah, George Iosifidis, Salah Eddine Elayoubi, Tijani Chahed

Abstract

We study Constrained Online Convex Optimization with Memory (COCO-M), where both the loss and the constraints depend on a finite window of past decisions made by the learner. This setting extends the previously studied unconstrained online optimization with memory framework and captures practical problems such as the control of constrained dynamical systems and scheduling with reconfiguration budgets. For this problem, we propose the first algorithms that achieve sublinear regret and sublinear cumulative constraint violation under time-varying constraints, both with and without predictions of future loss and constraint functions. Without predictions, we introduce an adaptive penalty approach that guarantees sublinear regret and constraint violation. When short-horizon and potentially unreliable predictions are available, we reinterpret the problem as online learning with delayed feedback and design an optimistic algorithm whose performance improves as prediction accuracy improves, while remaining robust when predictions are inaccurate. Our results bridge the gap between classical constrained online convex optimization and memory-dependent settings, and provide a versatile learning toolbox with diverse applications.

Constrained Online Convex Optimization with Memory and Predictions

Abstract

Paper Structure (30 sections, 10 theorems, 112 equations, 4 figures, 1 table, 3 algorithms)

This paper contains 30 sections, 10 theorems, 112 equations, 4 figures, 1 table, 3 algorithms.

Introduction
Related work
COCO Bounds.
COCO-M and Non-stochastic Control.
Optimism.
Summary.
Preliminaries
Background.
Learning Model & Assumptions.
Algorithm Design & Analysis
Problem $\texttt{COCO-M}^2$: Double Memory Effect
Discussion.
Problem $\texttt{COCO-M}$
Discussion
Benefiting from Predictions
...and 15 more sections

Key Result

Lemma 1

Under Assumptions 1-3, performing OGD on eq:surrogate with step $\eta_t\!=\!\frac{\sqrt{2}\|\mathcal{X}\|}{2\sqrt{\sum_{\tau=1}^{t} \|\nabla \hat{\mathcal{L}}_t(x_t)\|^2}}$, we obtain: where the benchmark set depends on the problem: $\mathcal{X}_T^{\mathrm{b}}=\mathcal{X}_T^{m}$ for COCO-M$^2$ and $\mathcal{X}_T^{\mathrm{b}}=\mathcal{X}_T$ for COCO-M.

Figures (4)

Figure 1: decision stages of COCO-M (with predictions).
Figure 2: Diagonal: $Z_t(x_t)$ depends only on $x_t$, but includes loss/constraint components from next $m$ rounds. Vertical: $\mathcal{L}_t(x_{t-m}^t)$ includes function components only from round $t$, but depends on all past $m$ decisions.
Figure 3: Notation used throughout the paper
Figure 4: Performance of Algorithm \ref{['algo:memory-coco']} on $\texttt{COCO‑M}^{2}$ with $x^{\star}\!\in\!\mathcal{X}^{\text{m}}_T$.

Theorems & Definitions (18)

Remark 1
Lemma 1
Lemma 2: Regret decomposition sinha2024optimal
Theorem 1: Regret and CCV for $\texttt{COCO-M}^2$
Theorem 2: Regret and CCV with memory-free constraint
Lemma 3
Theorem 3
proof : Proof Theorem \ref{['thm:obj-memory']}
proof : Proof of Theorem \ref{['thm:full-memory']}
Theorem 4: Regret and CCV with memory-free constraint
...and 8 more

Constrained Online Convex Optimization with Memory and Predictions

Abstract

Constrained Online Convex Optimization with Memory and Predictions

Authors

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (18)