Online Non-Stationary Stochastic Quasar-Convex Optimization

Yuen-Man Pun; Iman Shames

Online Non-Stationary Stochastic Quasar-Convex Optimization

Yuen-Man Pun, Iman Shames

TL;DR

This paper develops a unified online optimization framework for time-varying losses that satisfy quasar-convexity and strong quasar-convexity, deriving dynamic regret bounds for online gradient descent in terms of path variation and gradient noise. It extends the theory to generalized linear models with time-varying parameters and analyzes activation functions including Leaky ReLU, logistic, and ReLU, providing activation-specific constants and step-size guidelines. The results yield sublinear regret under sublinear variation and noise growth, and are complemented by simulations demonstrating sublinear regret growth even with noise. The findings offer a principled approach for adapting non-convex unimodal objectives in online settings and have implications for real-time neural-model-like learning and dynamic system identification.

Abstract

Recent research has shown that quasar-convexity can be found in applications such as identification of linear dynamical systems and generalized linear models. Such observations have in turn spurred exciting developments in design and analysis algorithms that exploit quasar-convexity. In this work, we study the online stochastic quasar-convex optimization problems in a dynamic environment. We establish regret bounds of online gradient descent in terms of cumulative path variation and cumulative gradient variance for losses satisfying quasar-convexity and strong quasar-convexity. We then apply the results to generalized linear models (GLM) when the underlying parameter is time-varying. We establish regret bounds of online gradient descent when applying to GLMs with leaky ReLU activation function, logistic activation function, and ReLU activation function. Numerical results are presented to corroborate our findings.

Online Non-Stationary Stochastic Quasar-Convex Optimization

TL;DR

Abstract

Paper Structure (15 sections, 15 theorems, 94 equations, 3 figures, 1 table)

This paper contains 15 sections, 15 theorems, 94 equations, 3 figures, 1 table.

Introduction
Problem Statement and Preliminaries
Online Quasar-Convex Optimization
Online Strongly Quasar-Convex Optimization
Applications
Leaky ReLU
Logistic Function
Single ReLU
Simulations
Conclusion
Proofs for Theorem \ref{['thm:str-qua-cvx']}
Proofs for Section \ref{['sec:GLM']}
Proofs for Section \ref{['subsec:LeakyReLU']}
Proof for Section \ref{['subsec:logistic']}
Proofs for Section \ref{['subsec:relu']}

Key Result

Proposition 1

Let $f\colon\mathbb{R}^n\to\mathbb{R}$ be a differentiable function. If $f$ is $L$-smooth; i.e., for all $\bm{w},\bm{w}'\in\mathbb{R}^n$, then $f$ is $\Gamma$-weakly smooth where $\Gamma = 2L$.

Figures (3)

Figure 1: Leaky ReLU
Figure 2: Logistic
Figure 3: ReLU

Theorems & Definitions (26)

Definition 1: Quasar-Convexity HSS20
Definition 2: Weak Smoothness HMR18
Proposition 1: Smoothness Implies Weak Smoothness
proof
Theorem 1: Regret Bounds for Quasar-Convex Losses
Remark 1: Boundedness of Sets of Interest
Remark 2: Step Size Selection for Quasar-Convex Losses
proof
Proposition 2: Convergence of Offline Gradient Descent
Remark 3: Step Size Selection for Strongly Quasar-Convex Losses
...and 16 more

Online Non-Stationary Stochastic Quasar-Convex Optimization

TL;DR

Abstract

Online Non-Stationary Stochastic Quasar-Convex Optimization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (26)