Table of Contents
Fetching ...

Learning-Augmented Algorithms with Explicit Predictors

Marek Elias, Haim Kaplan, Yishay Mansour, Shay Moran

TL;DR

This work unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand.

Abstract

Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data. These approaches have demonstrated an enhancement in performance when the predictions are accurate, while also ensuring robustness by providing worst-case guarantees when predictions fail. In this paper we focus on online problems; prior research in this context was focused on a paradigm where the predictor is pre-trained on past data and then used as a black box (to get the predictions it was trained for). In contrast, in this work, we unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge. In particular we allow the predictor to learn as it receives larger parts of the input, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand. Adopting this perspective, we focus on a number of fundamental problems, including caching and scheduling, which have been well-studied in the black-box setting. For each of the problems we consider, we introduce new algorithms that take advantage of explicit learning algorithms which we carefully design towards optimizing the overall performance. We demonstrate the potential of our approach by deriving performance bounds which improve over those established in previous work.

Learning-Augmented Algorithms with Explicit Predictors

TL;DR

This work unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand.

Abstract

Recent advances in algorithmic design show how to utilize predictions obtained by machine learning models from past and present data. These approaches have demonstrated an enhancement in performance when the predictions are accurate, while also ensuring robustness by providing worst-case guarantees when predictions fail. In this paper we focus on online problems; prior research in this context was focused on a paradigm where the predictor is pre-trained on past data and then used as a black box (to get the predictions it was trained for). In contrast, in this work, we unpack the predictor and integrate the learning problem it gives rise for within the algorithmic challenge. In particular we allow the predictor to learn as it receives larger parts of the input, with the ultimate goal of designing online learning algorithms specifically tailored for the algorithmic task at hand. Adopting this perspective, we focus on a number of fundamental problems, including caching and scheduling, which have been well-studied in the black-box setting. For each of the problems we consider, we introduce new algorithms that take advantage of explicit learning algorithms which we carefully design towards optimizing the overall performance. We demonstrate the potential of our approach by deriving performance bounds which improve over those established in previous work.
Paper Structure (41 sections, 39 theorems, 32 equations, 1 figure)

This paper contains 41 sections, 39 theorems, 32 equations, 1 figure.

Key Result

Theorem 1

Let $\mathcal{H}$ be a hypothesis class of size $\ell$ and $I$ be an input instance with offline optimum value $\mathop{\mathrm{OPT}}\nolimits(I)$. There is a deterministic algorithm for the realizable setting (i.e., $I\in \mathcal{H}$) which has cost at most $\mathop{\mathrm{OPT}}\nolimits(I) + k\l

Figures (1)

  • Figure 1: Summary of our results. Notation: $\ell = |\mathcal{H}|$; $k$ and $T$: cache size and instance length respectively in caching; $m$: the number of machines in load balancing; $n$: the number of jobs in non-clairvoyant scheduling; $\mu^*$: distance of the input from the hypothesis class in caching and non-clairvoyant scheduling; $\mathop{\mathrm{ALG}}\nolimits^*$: cost of the best algorithmic strategy suggested by $\mathcal{H}$.

Theorems & Definitions (66)

  • Theorem 1: Caching
  • Theorem 2: Load balancing
  • Theorem 3: Completion Time
  • Lemma 5
  • Lemma 6
  • proof
  • Theorem 7
  • Proposition 8: LenstraST90
  • Lemma 9
  • proof
  • ...and 56 more