Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods
Davoud Ataee Tarzanagh, Parvin Nazari, Bojian Hou, Li Shen, Laura Balzano
TL;DR
This work introduces online bilevel optimization (OBO) and formalizes bilevel regret notions, including dynamic and static variants, with outer/inner path-length regularities to capture nonstationarity. It proposes Online Alternating Gradient Descent (OAGD) that uses a time-averaged hypergradient to update the outer variable while performing inner updates, achieving regret bounds that scale with $S_{p,T}=P_{p,T}+Y_{p,T}$. The authors establish strong theoretical results across strongly convex, convex, and non-convex settings, including lower bounds and a bilevel local regret bound for non-convex outer losses. They validate OBO experimentally on online hyperparameter learning for dynamic regression, online parametric loss tuning for imbalanced data, and online meta-learning, demonstrating competitive performance and favorable runtimes relative to baselines.
Abstract
This paper introduces \textit{online bilevel optimization} in which a sequence of time-varying bilevel problems is revealed one after the other. We extend the known regret bounds for online single-level algorithms to the bilevel setting. Specifically, we provide new notions of \textit{bilevel regret}, develop an online alternating time-averaged gradient method that is capable of leveraging smoothness, and give regret bounds in terms of the path-length of the inner and outer minimizer sequences.
