Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation

Xiaoning Bai; Shangzhi Zeng; Jin Zhang; Lezhi Zhang

Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation

Xiaoning Bai, Shangzhi Zeng, Jin Zhang, Lezhi Zhang

TL;DR

This work tackles bilevel optimization with a convex lower-level by introducing AGILS, an alternating gradient-type algorithm that leverages a Moreau envelope reformulation $(\mathrm{VP})_\gamma$ and inexact proximal lower-level solves. AGILS incorporates an adaptive penalty and a feasibility-correction step to maintain constraint satisfaction, and it provides convergence guarantees to KKT points of a relaxed problem $(\mathrm{VP})_\gamma^{\epsilon}$, with subsequential convergence under mild assumptions and full sequence convergence under a KL property. The method is validated on a toy problem and a sparse group Lasso hyperparameter selection task, showing superior efficiency and robustness to the inexactness of the lower-level solutions compared to several baselines. Overall, AGILS offers a flexible, scalable approach for high-dimensional bilevel problems where exact lower-level solves are impractical, delivering provable convergence and competitive empirical performance.

Abstract

In this paper, we study a class of bilevel optimization problems where the lower-level problem is a convex composite optimization model, which arises in various applications, including bilevel hyperparameter selection for regularized regression models. To solve these problems, we propose an Alternating Gradient-type algorithm with Inexact Lower-level Solutions (AGILS) based on a Moreau envelope-based reformulation of the bilevel optimization problem. The proposed algorithm does not require exact solutions of the lower-level problem at each iteration, improving computational efficiency. We prove the convergence of AGILS to stationary points and, under the Kurdyka-Łojasiewicz (KL) property, establish its sequential convergence. Numerical experiments, including a toy example and a bilevel hyperparameter selection problem for the sparse group Lasso model, demonstrate the effectiveness of the proposed AGILS.

Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation

TL;DR

This work tackles bilevel optimization with a convex lower-level by introducing AGILS, an alternating gradient-type algorithm that leverages a Moreau envelope reformulation

and inexact proximal lower-level solves. AGILS incorporates an adaptive penalty and a feasibility-correction step to maintain constraint satisfaction, and it provides convergence guarantees to KKT points of a relaxed problem

, with subsequential convergence under mild assumptions and full sequence convergence under a KL property. The method is validated on a toy problem and a sparse group Lasso hyperparameter selection task, showing superior efficiency and robustness to the inexactness of the lower-level solutions compared to several baselines. Overall, AGILS offers a flexible, scalable approach for high-dimensional bilevel problems where exact lower-level solves are impractical, delivering provable convergence and competitive empirical performance.

Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation

TL;DR

Abstract

Alternating Gradient-Type Algorithm for Bilevel Optimization with Inexact Lower-Level Solutions via Moreau Envelope-based Reformulation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (39)