Optimal Restart Strategies for Parameter-dependent Optimization Algorithms

Lisa Schönenberger; Hans-Georg Beyer

Optimal Restart Strategies for Parameter-dependent Optimization Algorithms

Lisa Schönenberger, Hans-Georg Beyer

TL;DR

This work examines restart strategies for parameter-dependent optimization algorithms where success requires surpassing an unknown threshold $\hat{\lambda}$. By classifying strategies into additive, multiplicative, and power-law types and introducing a loss function, the authors derive bounds and show that only multiplicative strategies yield a bounded relative loss in the asymptotic regime. They establish that the asymptotically optimal multiplicative factor is $\hat{\rho}=2$, with the $\mathcal{R}^\times$ and $\mathcal{R}^*$ variants achieving bounded performance and, for $\mathcal{R}^\times$, a worst-case relative loss of 3. Conversely, additive $\mathcal{R}^+$ and power-law $\mathcal{R}^\#$ strategies are strictly unbounded in relative loss. The results provide a principled, parameter-independent guideline favoring multiplicative restarts for robust, near-optimal performance in the face of unknown $\hat{\lambda}$.

Abstract

This paper examines restart strategies for algorithms whose successful termination depends on an unknown parameter $λ$. After each restart, $λ$ is increased, until the algorithm terminates successfully. It is assumed that there is a unique, unknown, optimal value for $λ$. For the algorithm to run successfully, this value must be reached or surpassed. The key question is whether there exists an optimal strategy for selecting $λ$ after each restart taking into account that the computational costs (runtime) increases with $λ$. In this work, potential restart strategies are classified into parameter-dependent strategy types. A loss function is introduced to quantify the wasted computational cost relative to the optimal strategy. A crucial requirement for any efficient restart strategy is that its loss, relative to the optimal $λ$, remains bounded. To this end, upper and lower bounds of the loss are derived. Using these bounds it will be shown that not all strategy types are bounded. However, for a particular strategy type, where $λ$ is increased multiplicatively by a constant factor $λ$, the relative loss function is bounded. Furthermore, it will be demonstrated that within this strategy type, there exists an optimal value for $λ$ that minimizes the maximum relative loss. In the asymptotic limit, this optimal choice of $λ$ does not depend on the unknown optimal $λ$.

Optimal Restart Strategies for Parameter-dependent Optimization Algorithms

TL;DR

This work examines restart strategies for parameter-dependent optimization algorithms where success requires surpassing an unknown threshold

. By classifying strategies into additive, multiplicative, and power-law types and introducing a loss function, the authors derive bounds and show that only multiplicative strategies yield a bounded relative loss in the asymptotic regime. They establish that the asymptotically optimal multiplicative factor is

, with the

and

variants achieving bounded performance and, for

, a worst-case relative loss of 3. Conversely, additive

and power-law

strategies are strictly unbounded in relative loss. The results provide a principled, parameter-independent guideline favoring multiplicative restarts for robust, near-optimal performance in the face of unknown

Abstract

This paper examines restart strategies for algorithms whose successful termination depends on an unknown parameter

. After each restart,

is increased, until the algorithm terminates successfully. It is assumed that there is a unique, unknown, optimal value for

. For the algorithm to run successfully, this value must be reached or surpassed. The key question is whether there exists an optimal strategy for selecting

after each restart taking into account that the computational costs (runtime) increases with

. In this work, potential restart strategies are classified into parameter-dependent strategy types. A loss function is introduced to quantify the wasted computational cost relative to the optimal strategy. A crucial requirement for any efficient restart strategy is that its loss, relative to the optimal

, remains bounded. To this end, upper and lower bounds of the loss are derived. Using these bounds it will be shown that not all strategy types are bounded. However, for a particular strategy type, where

is increased multiplicatively by a constant factor

, the relative loss function is bounded. Furthermore, it will be demonstrated that within this strategy type, there exists an optimal value for

that minimizes the maximum relative loss. In the asymptotic limit, this optimal choice of

does not depend on the unknown optimal

Paper Structure (12 sections, 14 theorems, 86 equations, 9 figures, 1 algorithm)

This paper contains 12 sections, 14 theorems, 86 equations, 9 figures, 1 algorithm.

Introduction
Restart Strategies
The Loss Function
The Loss Function for the $\mathcal{R}^+$-RS
The Loss Function for the $\mathcal{R}^{\times}$-RS
The Loss Function for the $\mathcal{R}^*$-RS
The Loss Function for the $\mathcal{R}^{\#}$-RS
The Relative Loss Function
Relative Loss Function and Optimal $\nu$ of the $\mathcal{R}^+$-RS
Relative Loss and Optimal $\rho$ of the Multiplicative Strategy Types
Relative Loss of the $\mathcal{R}^\#$-RS
Conclusion and Outlook

Key Result

Theorem 1

Let $\mathcal{L}^+(\hat{\lambda}, \nu)$ be the loss function (eq:lossSimple) of the $\mathcal{R}^+$-RS with restart parameter $\nu \in \mathbb{N} \setminus \{0\}$ and $\lambda_k = \lambda_0 + k\nu$. Let Then $\mathcal{L}^+_\mathrm{up}(\hat{\lambda}, \nu)$ is an upper bound of $\mathcal{L}^+(\hat{\lambda}, \nu)$.

Figures (9)

Figure 1: Loss function (\ref{['eq:lossSimple']}) for $\mathcal{R}^+$ depending on $\hat{\lambda}$. Gray markers represent the numerical values of the loss function, determined with Alg. \ref{['alg:lossNum']}. The green dashed line shows the upper bound (\ref{['eq:lossPUp']}) and the blue dashed line shows the lower bound (\ref{['eq:lossPLow']}).
Figure 2: Loss function (\ref{['eq:lossSimple']}) for $\mathcal{R}^\times$ depending on $\hat{\lambda}$. Gray markers represent the numerical values of the loss function, determined with Alg. \ref{['alg:lossNum']}. The green dashed line shows the upper bound (\ref{['eq:lossModUp']}) and the blue dashed line shows the lower bound (\ref{['eq:lossModLow']}).
Figure 3: Loss function (\ref{['eq:lossSimple']}) for $\mathcal{R}^*$ depending on $\hat{\lambda}$. Gray markers represent the numerical values of the loss function, determined with Alg. \ref{['alg:lossNum']}. The green dashed line shows the upper bound (\ref{['eq:lossUp']}) and the blue dashed line shows the lower bound (\ref{['eq:lLow']}).
Figure 4: Loss function (\ref{['eq:lossSimple']}) depending on $\hat{\lambda}$. Markers represent the numerical values of the loss function, determined with Alg. \ref{['alg:lossNum']}, black for the $\mathcal{R}^*$-RS and gray for the $R^\times$-RS. The green line in the middle and right plot shows the upper bound (\ref{['eq:lossUp']}). The gray dashed line shows the lower bound (\ref{['eq:lLow']}) of the $\mathcal{R}^*$-RS and the blue dashed line shows the lower bound (\ref{['eq:lossModLow']}) of the $\mathcal{R}^\times$-RS.
Figure 5: Left and middle plot: loss function (\ref{['eq:lossSimple']}) for $\mathcal{R}^\#$ depending on $\hat{\lambda}$. Gray markers represent the numerical values of the loss function, determined with Alg. \ref{['alg:lossNum']}. The green dashed line shows the upper bound (\ref{['eq:lossPowUp']}) and the blue dashed line shows the lower bound (\ref{['eq:lossPowLow']}). Right plot: visualization for Eqs. (\ref{['eq:intUp']}) and (\ref{['eq:intLow']}).
...and 4 more figures

Theorems & Definitions (28)

Theorem 1
proof
Theorem 2
proof
Theorem 3
proof
Theorem 4
proof
Theorem 5
proof
...and 18 more

Optimal Restart Strategies for Parameter-dependent Optimization Algorithms

TL;DR

Abstract

Optimal Restart Strategies for Parameter-dependent Optimization Algorithms

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (28)