Efficient Penalty-Based Bilevel Methods: Improved Analysis, Novel Updates, and Flatness Condition
Liuyuan Jiang, Quan Xiao, Lisha Chen, Tianyi Chen
TL;DR
This work advances penalty-based BLO by decoupling upper- and lower-level variables through a reformulated outer objective $F_ abla(x)$, enabling constant-step-size gradient-like updates and improved outer-loop efficiency. It introduces ALT-PBGD, an alternating-update method that tightens convergence rates to $O( abla^{-1})$ outer iterations for uncoupled CC BLOs, and PBGD-Free, a fully single-loop, value-function-free algorithm that minimizes inner computation at the cost of a bias term. To address potential divergence of PBGD-Free, the authors propose the $(oldsymbol{ extdelta},oldsymbol{ extalpha})$-flatness condition, which relaxes Lipschitz assumptions and bounds the penalty-induced bias, yielding convergence guarantees to stationary points under mild conditions and for both uncoupled and coupled constraint BLOs. The theory is complemented by empirical results on SVM hyperparameter optimization and large-language-model PEFT, showing that the proposed methods achieve comparable or better accuracy with substantially reduced computational cost, highlighting practical impact for scalable BLO in constrained settings.
Abstract
Penalty-based methods have become popular for solving bilevel optimization (BLO) problems, thanks to their effective first-order nature. However, they often require inner-loop iterations to solve the lower-level (LL) problem and small outer-loop step sizes to handle the increased smoothness induced by large penalty terms, leading to suboptimal complexity. This work considers the general BLO problems with coupled constraints (CCs) and leverages a novel penalty reformulation that decouples the upper- and lower-level variables. This yields an improved analysis of the smoothness constant, enabling larger step sizes and reduced iteration complexity for Penalty-Based Gradient Descent algorithms in ALTernating fashion (ALT-PBGD). Building on the insight of reduced smoothness, we propose PBGD-Free, a novel fully single-loop algorithm that avoids inner loops for the uncoupled constraint BLO. For BLO with CCs, PBGD-Free employs an efficient inner-loop with substantially reduced iteration complexity. Furthermore, we propose a novel curvature condition describing the "flatness" of the upper-level objective with respect to the LL variable. This condition relaxes the traditional upper-level Lipschitz requirement, enables smaller penalty constant choices, and results in a negligible penalty gradient term during upper-level variable updates. We provide rigorous convergence analysis and validate the method's efficacy through hyperparameter optimization for support vector machines and fine-tuning of large language models.
