A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints
Liuyuan Jiang, Quan Xiao, Victor M. Tenorio, Fernando Real-Rojas, Antonio G. Marques, Tianyi Chen
TL;DR
The paper tackles bilevel optimization with coupled lower-level constraints by introducing a primal-dual-assisted penalty reformulation, enabling a projection-free, fully first-order algorithm named BLOCC. BLOCC solves a smooth outer problem on $F_{\gamma}(x)$ while using an efficient max-min inner solver to compute hypergradients, achieving a nonasymptotic rate of $\tilde{\mathcal{O}}(\epsilon^{-2.5})$ and improving to $\tilde{\mathcal{O}}(\epsilon^{-1.5})$ when the CCs are affine in $y$. The authors establish differentiability and gradient expressions for the penalty objective, prove convergence under LICQ and RSI-type conditions, and present two enhanced results for the affine-CC special case with linear convergence. Empirical results on SVM hyperparameter tuning and large-scale transportation network design demonstrate BLOCC’s scalability and competitiveness against state-of-the-art baselines, highlighting its practical impact for constrained BLO in machine learning and operations research. Overall, BLOCC offers a robust, first-order, projection-free framework for complex BLOs with coupled constraints, enabling efficient solutions to high-dimensional problems with real-world data.
Abstract
Interest in bilevel optimization has grown in recent years, partially due to its applications to tackle challenging machine-learning problems. Several exciting recent works have been centered around developing efficient gradient-based algorithms that can solve bilevel optimization problems with provable guarantees. However, the existing literature mainly focuses on bilevel problems either without constraints, or featuring only simple constraints that do not couple variables across the upper and lower levels, excluding a range of complex applications. Our paper studies this challenging but less explored scenario and develops a (fully) first-order algorithm, which we term BLOCC, to tackle BiLevel Optimization problems with Coupled Constraints. We establish rigorous convergence theory for the proposed algorithm and demonstrate its effectiveness on two well-known real-world applications - hyperparameter selection in support vector machine (SVM) and infrastructure planning in transportation networks using the real data from the city of Seville.
