Derivative-Free Bilevel Optimization with Inexact Lower-Level Solutions

Edoardo Cesaroni; Giampaolo Liuzzi; Stefano Lucidi

Derivative-Free Bilevel Optimization with Inexact Lower-Level Solutions

Edoardo Cesaroni, Giampaolo Liuzzi, Stefano Lucidi

Abstract

In this work, we propose derivative-free framework for bilevel optimization. We consider both the upper and lower-level problems with bound constraints on the variables, as well as general nonlinear constraints, assuming that first-order information (in the upper-level) is not available or it is impractical to obtain. The lower-level problem is solved with an accuracy that is progressively refined throughout the optimization process. We first analyze the case in which the upper-level problem is subject only to bound constraints, establishing convergence to Clarke-Jahn stationary points when the refinement process is allowed to reach its maximum precision. When a limitation is imposed on this refinement process, we prove convergence to approximate stationary points using an extended notion of Goldstein stationarity. Finally, we extend the proposed framework to handle more complex constraints via an exact penalty function approach, proving convergence to stationary points under suitable assumptions. A comprehensive numerical study on 160 problems from the BOLIB collection shows that the adaptive accuracy strategy consistently yields better results than fixed-precision solves, with its benefits becoming more pronounced as the required lower-level accuracy becomes more stringent.

Derivative-Free Bilevel Optimization with Inexact Lower-Level Solutions

Abstract

Paper Structure (17 sections, 13 theorems, 104 equations, 2 figures, 2 tables, 2 algorithms)

This paper contains 17 sections, 13 theorems, 104 equations, 2 figures, 2 tables, 2 algorithms.

Introduction
Contributions
Assumptions
Notations and definitions
Bound-Constrained Upper-Level Problem
A Linesearch Derivative-free Method for Bilevel Programming
Convergence analysis for the exact case ($\bar{\zeta} = 0$)
Convergence analysis for the inexact case ($\bar{\zeta} > 0$)
Handling of General Inequality Upper-Level Constraints
Numerical Results
Test problems collection
Performance evaluation methodology
Algorithms implementation details and parameter settings
Effect of multiple step sizes
Effect of Projected Extrapolation
...and 2 more sections

Key Result

Proposition 1

\newlabelprop:nocycle_projectedLS0 The Projected Extrapolation cannot cycle indefinitely between Step 11 and Step 17.

Figures (2)

Figure 1: Data profiles (top row) and performance profiles (bottom row) for $\bar{\zeta} = 10^{-6}$. Left half: MS-DFN-LLA (multiple step sizes) vs. DFN-LLA (single step size). Right half: MS-DFN-LLA (with extrapolation) vs. MS-DFN-NL-LLA (without). Odd columns: $\tau = 10^{-3}$; even columns: $\tau = 10^{-6}$.
Figure 2: Data profiles (columns 1--2) and performance profiles (columns 3--4) comparing MS-DFN-LLF (fixed accuracy) with MS-DFN-LLA (adaptive accuracy). Rows correspond to $\bar{\zeta} = 10^{-3}$ (top), $10^{-6}$ (middle), $10^{-9}$ (bottom). Odd columns: $\tau = 10^{-3}$; even columns: $\tau = 10^{-6}$.

Theorems & Definitions (22)

Remark 1
Remark 2
Definition 1: Clarke and Clarke-Jahn generalized directional derivative
Definition 2: Dense sequence
Definition 1: Cone of Feasible Directions
Definition 2: Clarke and Clarke-Jahn stationarity
Definition 3: $(\delta, \epsilon)$-Goldstein stationarity
Proposition 1
Proposition 2
Proposition 3
...and 12 more

Derivative-Free Bilevel Optimization with Inexact Lower-Level Solutions

Abstract

Derivative-Free Bilevel Optimization with Inexact Lower-Level Solutions

Authors

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (22)