A Single-Loop Gradient Algorithm for Pessimistic Bilevel Optimization via Smooth Approximation
Qichao Cao, Shangzhi Zeng, Jin Zhang
TL;DR
This work addresses the challenge of Pessimistic Bilevel Optimization (PBO) by introducing a smooth, differentiable surrogate $\phi_{\rho,\sigma}(x)$ via penalization and regularization, enabling fully first-order gradient methods. Building on this, the authors propose SiPBA, a single-loop gradient algorithm that uses a one-step ascent-descent to approximate the saddle point and an inexact gradient to update the upper-level variable, avoiding second-order derivatives and inner-loop subproblem solves. Theoretical results establish epi-convergence of the smooth approximation to the original PBO and non-asymptotic convergence rates for SiPBA, along with a practical parameter schedule. Empirical evaluations on synthetic problems, spam classification, and hyper-representation tasks demonstrate competitive performance and efficiency, highlighting SiPBA’s robustness to hyperparameters and problem scale. Overall, the paper advances scalable, robust optimization under worst-case follower responses with potential impact on adversarial learning and robust hyperparameter tuning.
Abstract
Bilevel optimization has garnered significant attention in the machine learning community recently, particularly regarding the development of efficient numerical methods. While substantial progress has been made in developing efficient algorithms for optimistic bilevel optimization, the study of methods for solving Pessimistic Bilevel Optimization (PBO) remains relatively less explored, especially the design of fully first-order, single-loop gradient-based algorithms. This paper aims to bridge this research gap. We first propose a novel smooth approximation to the PBO problem, using penalization and regularization techniques. Building upon this approximation, we then propose SiPBA (Single-loop Pessimistic Bilevel Algorithm), a new gradient-based method specifically designed for PBO which avoids second-order derivative information or inner-loop iterations for subproblem solving. We provide theoretical validation for the proposed smooth approximation scheme and establish theoretical convergence for the algorithm SiPBA. Numerical experiments on synthetic examples and practical applications demonstrate the effectiveness and efficiency of SiPBA.
