Table of Contents
Fetching ...

Competitive strategies to use "warm start" algorithms with predictions

Vaidehi Srinivas, Avrim Blum

TL;DR

This work studies how predictions can accelerate warm-start algorithms when solving sequences of related problems. It develops three complementary avenues: (i) offline guarantees against a set of $k$ fixed predictions, achieving a constant-factor approximation to the best $k$-point predictor; (ii) leveraging coarse information via learnable $k$-wise partitions to remove the linear $k$-factor in runtime; and (iii) online guarantees against moving trajectories, presenting a deterministic algorithm that is $O(k^4\ln^2 k)$-competitive with any set of $k$ trajectories while keeping total runtime within a constant factor of the work done by the subroutine. The paper also connects online ball search to the $k$-server problem, showing reductions that yield $O(k^2)$-type guarantees in some regimes, and provides a unified framework for learning and deploying multiple predictions in warm-start contexts. Collectively, these results establish that warm starts with multiple, learnable predictions can yield substantial improvements over single-prediction baselines in both offline and online settings, with broad potential for practical speedups in related optimization and search tasks.

Abstract

We consider the problem of learning and using predictions for warm start algorithms with predictions. In this setting, an algorithm is given an instance of a problem, and a prediction of the solution. The runtime of the algorithm is bounded by the distance from the predicted solution to the true solution of the instance. Previous work has shown that when instances are drawn iid from some distribution, it is possible to learn an approximately optimal fixed prediction (Dinitz et al, NeurIPS 2021), and in the adversarial online case, it is possible to compete with the best fixed prediction in hindsight (Khodak et al, NeurIPS 2022). In this work we give competitive guarantees against stronger benchmarks that consider a set of $k$ predictions $\mathbf{P}$. That is, the "optimal offline cost" to solve an instance with respect to $\mathbf{P}$ is the distance from the true solution to the closest member of $\mathbf{P}$. This is analogous to the $k$-medians objective function. In the distributional setting, we show a simple strategy that incurs cost that is at most an $O(k)$ factor worse than the optimal offline cost. We then show a way to leverage learnable coarse information, in the form of partitions of the instance space into groups of "similar" instances, that allows us to potentially avoid this $O(k)$ factor. Finally, we consider an online version of the problem, where we compete against offline strategies that are allowed to maintain a moving set of $k$ predictions or "trajectories," and are charged for how much the predictions move. We give an algorithm that does at most $O(k^4 \ln^2 k)$ times as much work as any offline strategy of $k$ trajectories. This algorithm is deterministic (robust to an adaptive adversary), and oblivious to the setting of $k$. Thus the guarantee holds for all $k$ simultaneously.

Competitive strategies to use "warm start" algorithms with predictions

TL;DR

This work studies how predictions can accelerate warm-start algorithms when solving sequences of related problems. It develops three complementary avenues: (i) offline guarantees against a set of fixed predictions, achieving a constant-factor approximation to the best -point predictor; (ii) leveraging coarse information via learnable -wise partitions to remove the linear -factor in runtime; and (iii) online guarantees against moving trajectories, presenting a deterministic algorithm that is -competitive with any set of trajectories while keeping total runtime within a constant factor of the work done by the subroutine. The paper also connects online ball search to the -server problem, showing reductions that yield -type guarantees in some regimes, and provides a unified framework for learning and deploying multiple predictions in warm-start contexts. Collectively, these results establish that warm starts with multiple, learnable predictions can yield substantial improvements over single-prediction baselines in both offline and online settings, with broad potential for practical speedups in related optimization and search tasks.

Abstract

We consider the problem of learning and using predictions for warm start algorithms with predictions. In this setting, an algorithm is given an instance of a problem, and a prediction of the solution. The runtime of the algorithm is bounded by the distance from the predicted solution to the true solution of the instance. Previous work has shown that when instances are drawn iid from some distribution, it is possible to learn an approximately optimal fixed prediction (Dinitz et al, NeurIPS 2021), and in the adversarial online case, it is possible to compete with the best fixed prediction in hindsight (Khodak et al, NeurIPS 2022). In this work we give competitive guarantees against stronger benchmarks that consider a set of predictions . That is, the "optimal offline cost" to solve an instance with respect to is the distance from the true solution to the closest member of . This is analogous to the -medians objective function. In the distributional setting, we show a simple strategy that incurs cost that is at most an factor worse than the optimal offline cost. We then show a way to leverage learnable coarse information, in the form of partitions of the instance space into groups of "similar" instances, that allows us to potentially avoid this factor. Finally, we consider an online version of the problem, where we compete against offline strategies that are allowed to maintain a moving set of predictions or "trajectories," and are charged for how much the predictions move. We give an algorithm that does at most times as much work as any offline strategy of trajectories. This algorithm is deterministic (robust to an adaptive adversary), and oblivious to the setting of . Thus the guarantee holds for all simultaneously.
Paper Structure (22 sections, 19 theorems, 83 equations, 3 algorithms)

This paper contains 22 sections, 19 theorems, 83 equations, 3 algorithms.

Key Result

Lemma 3.1

Given an algorithm-with-predictions $\mathcal{A}$ for instances in $\mathbf{I}$, and a set of $k$ predictions $\mathbf{P} = (P_1, \dots, P_k), P_i \in \mathbf{S}$, for an instance $I$ with (unknown) true solution $S$, we can solve $I$ in time where $\mathbf{P}(S) = \mathop{\mathrm{argmin}}\limits_{P_j \in \mathbf{P}} \mathbf{d}(S, P_j)$.

Theorems & Definitions (53)

  • Definition 2.1: Warm start algorithm
  • Lemma 3.1: Using $k$ predictions
  • proof
  • Definition 3.3: Clustering cost of a set
  • Definition 3.4: Clustering cost of a distribution
  • Lemma 3.4: Learning (k) fixed points
  • proof
  • Theorem 3.5: Competing against $k$ fixed points offline
  • proof
  • Remark 3.6
  • ...and 43 more