Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

Shanqi Liu; Xin Liu

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

Shanqi Liu, Xin Liu

TL;DR

This work tackles online convex optimization with unknown linear budget constraints and partial feedback, formalizing a budgeted loss minimization problem where only gradient information and budget usage are observed. The authors introduce SELO, a Lyapunov-optimization–based primal–dual algorithm that uses gradient-based loss estimation, pessimistic budget estimates, and virtual queues to balance regret and budget adherence; its surrogate objective is $V\hat{f}_t(\mathbf x) + \langle \mathbf Q_t, \hat{g}_t(\mathbf x) \rangle + \frac{1}{2\eta}\|\mathbf x - \mathbf x_{t-1}\|^2$. They prove that SELO achieves $\tilde{O}(\sqrt{T})$ regret with $O(1)$ constraint violation under soft budgets and $\tilde{O}(\sqrt{T})$ regret with no violation under hard budgets, via a multi-step Lyapunov-drift analysis and a key regret+drift lemma. The approach is computationally efficient (nearly unconstrained primal update) and is validated on energy-aware task processing in distributed data centers, where SELO outperforms baselines while maintaining safety, highlighting its practical impact for safe, efficient online decision-making with partial feedback.

Abstract

This paper studies online convex optimization with unknown linear budget constraints, where only the gradient information of the objective and the bandit feedback of constraint functions are observed. We propose a safe and efficient Lyapunov-optimization algorithm (SELO) that can achieve an $O(\sqrt{T})$ regret and zero cumulative constraint violation. The result also implies SELO achieves $O(\sqrt{T})$ regret when the budget is hard and not allowed to be violated. The proposed algorithm is computationally efficient as it resembles a primal-dual algorithm where the primal problem is an unconstrained, strongly convex and smooth problem, and the dual problem has a simple gradient-type update. The algorithm and theory are further justified in a simulated application of energy-efficient task processing in distributed data centers.

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

TL;DR

. They prove that SELO achieves

regret with

constraint violation under soft budgets and

regret with no violation under hard budgets, via a multi-step Lyapunov-drift analysis and a key regret+drift lemma. The approach is computationally efficient (nearly unconstrained primal update) and is validated on energy-aware task processing in distributed data centers, where SELO outperforms baselines while maintaining safety, highlighting its practical impact for safe, efficient online decision-making with partial feedback.

Abstract

regret and zero cumulative constraint violation. The result also implies SELO achieves

regret when the budget is hard and not allowed to be violated. The proposed algorithm is computationally efficient as it resembles a primal-dual algorithm where the primal problem is an unconstrained, strongly convex and smooth problem, and the dual problem has a simple gradient-type update. The algorithm and theory are further justified in a simulated application of energy-efficient task processing in distributed data centers.

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

TL;DR

Abstract

Safe and Efficient Online Convex Optimization with Linear Budget Constraints and Partial Feedback

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (4)