Revisiting Projection-Free Online Learning with Time-Varying Constraints

Yibo Wang; Yuanyu Wan; Lijun Zhang

Revisiting Projection-Free Online Learning with Time-Varying Constraints

Yibo Wang, Yuanyu Wan, Lijun Zhang

TL;DR

This work addresses projection-free constrained online convex optimization with time-varying constraints by constructing a Lyapunov-based composite surrogate loss and applying a parameter-free online Frank-Wolfe (OFW) extension. It delivers improved regret and cumulative constraint violation bounds for both general convex and strongly convex losses, and extends the framework to bandit feedback using one-point gradient estimators with comparable rates. The proposed OFW-TVC and SCOFW-TVC algorithms, together with their bandit counterparts, achieve $\mathrm{Regret}_T=\mathcal{O}(T^{3/4})$ and $Q_T=\mathcal{O}(T^{3/4}\log T)$ in the general convex full-information setting, and $\mathrm{Regret}_T=\mathcal{O}(T^{2/3})$ and $Q_T=\mathcal{O}(T^{5/6})$ in the strongly convex regime, with corresponding bandit results including $\mathcal{O}(T^{3/4})$ and $\mathcal{O}(T^{2/3}\log T)$ terms. Experiments on real datasets like MovieLens and FilmTrust validate the theoretical gains, demonstrating the practicality of projection-free COCO under time-varying constraints. The work advances projection-free online learning by eliminating projection costs while rigorously balancing regret and long-term constraint satisfaction.

Abstract

We investigate constrained online convex optimization, in which decisions must belong to a fixed and typically complicated domain, and are required to approximately satisfy additional time-varying constraints over the long term. In this setting, the commonly used projection operations are often computationally expensive or even intractable. To avoid the time-consuming operation, several projection-free methods have been proposed with an $\mathcal{O}(T^{3/4} \sqrt{\log T})$ regret bound and an $\mathcal{O}(T^{7/8})$ cumulative constraint violation (CCV) bound for general convex losses. In this paper, we improve this result and further establish \textit{novel} regret and CCV bounds when loss functions are strongly convex. The primary idea is to first construct a composite surrogate loss, involving the original loss and constraint functions, by utilizing the Lyapunov-based technique. Then, we propose a parameter-free variant of the classical projection-free method, namely online Frank-Wolfe (OFW), and run this new extension over the online-generated surrogate loss. Theoretically, for general convex losses, we achieve an $\mathcal{O}(T^{3/4})$ regret bound and an $\mathcal{O}(T^{3/4} \log T)$ CCV bound, both of which are order-wise tighter than existing results. For strongly convex losses, we establish new guarantees of an $\mathcal{O}(T^{2/3})$ regret bound and an $\mathcal{O}(T^{5/6})$ CCV bound. Moreover, we also extend our methods to a more challenging setting with bandit feedback, obtaining similar theoretical findings. Empirically, experiments on real-world datasets have demonstrated the effectiveness of our methods.

Revisiting Projection-Free Online Learning with Time-Varying Constraints

TL;DR

and

in the general convex full-information setting, and

and

in the strongly convex regime, with corresponding bandit results including

and

terms. Experiments on real datasets like MovieLens and FilmTrust validate the theoretical gains, demonstrating the practicality of projection-free COCO under time-varying constraints. The work advances projection-free online learning by eliminating projection costs while rigorously balancing regret and long-term constraint satisfaction.

Abstract

regret bound and an

cumulative constraint violation (CCV) bound for general convex losses. In this paper, we improve this result and further establish \textit{novel} regret and CCV bounds when loss functions are strongly convex. The primary idea is to first construct a composite surrogate loss, involving the original loss and constraint functions, by utilizing the Lyapunov-based technique. Then, we propose a parameter-free variant of the classical projection-free method, namely online Frank-Wolfe (OFW), and run this new extension over the online-generated surrogate loss. Theoretically, for general convex losses, we achieve an

regret bound and an

CCV bound, both of which are order-wise tighter than existing results. For strongly convex losses, we establish new guarantees of an

regret bound and an

CCV bound. Moreover, we also extend our methods to a more challenging setting with bandit feedback, obtaining similar theoretical findings. Empirically, experiments on real-world datasets have demonstrated the effectiveness of our methods.

Revisiting Projection-Free Online Learning with Time-Varying Constraints

TL;DR

Abstract

Revisiting Projection-Free Online Learning with Time-Varying Constraints

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (16)