Achieving Better Local Regret Bound for Online Non-Convex Bilevel Optimization
Tingkai Jia, Haiguang Wang, Cheng Chen
TL;DR
This work tackles online non-convex bilevel optimization with time-varying objectives by developing hypergradient-based online algorithms. It introduces AOBO with adaptive inner iterations to achieve the optimal standard regret Reg(T) = $O(1+V_T)$ and an efficient fully single-loop variant FSOBO with Reg(T) = $O(1+V_T+H_{2,T}+E_{2,T})$, while also addressing windowed non-stationarity through WOBO, which attains Reg_w(T) = $O(T/W^2)$ with a matching lower bound $Ω(T/W^2)$. The paper provides sharp upper and lower bounds, including a lower bound demonstrating optimality, and validates the methods with online data cleaning and loss-tuning experiments. By removing reliance on sublinear environmental variation assumptions for windowed regret and reducing inner-loop costs, the work significantly advances provable efficiency in dynamic OBO settings. The results offer both theoretically tight guarantees and practical algorithms for dynamic bilevel problems in machine learning applications.
Abstract
Online bilevel optimization (OBO) has emerged as a powerful framework for many machine learning problems. Prior works have developed several algorithms that minimize the standard bilevel local regret or the window-averaged bilevel local regret of the OBO problem, but the optimality of existing regret bounds remains unclear. In this work, we establish optimal regret bounds for both settings. For standard bilevel local regret, we propose an algorithm that achieves the optimal regret $Ω(1+V_T)$ with at most $O(T\log T)$ total inner-level gradient evaluations. We further develop a fully single-loop algorithm whose regret bound includes an additional gradient-variation terms. For the window-averaged bilevel local regret, we design an algorithm that captures sublinear environmental variation through a window-based analysis and achieves the optimal regret $Ω(T/W^2)$. Experiments validate our theoretical findings and demonstrate the practical effectiveness of the proposed methods.
