Table of Contents
Fetching ...

SkipPredict: When to Invest in Predictions for Scheduling

Rana Shahout, Michael Mitzenmacher

TL;DR

This work studies how to incorporate prediction costs into queueing-based scheduling, introducing SkipPredict, a two-stage policy for an $M/G/1$ system with cheap one-bit predictions to separate short vs long jobs and expensive predictions to estimate remaining processing times for the long group. It analyzes two cost models—external (predictions cost time but do not affect service) and server-time (prediction processing consumes server resources)—and derives exact mean response-time expressions via the SOAP framework, comparing SkipPredict to FCFS, 1bit, SPRPT, and a DelayPredict baseline. The results show SkipPredict can outperform competing policies, especially when there is a sizable gap between cheap and expensive prediction costs, and when predictions for long jobs yield substantial gains under high load. The paper also explores generalizations to non-fixed costs and offers several SkipPredict variants and practical considerations for deployment, enabling cost-aware prediction usage in scheduling systems.

Abstract

In light of recent work on scheduling with predicted job sizes, we consider the effect of the cost of predictions in queueing systems, removing the assumption in prior research that predictions are external to the system's resources and/or cost-free. In particular, we introduce a novel approach to utilizing predictions, SkipPredict, designed to address their inherent cost. Rather than uniformly applying predictions to all jobs, we propose a tailored approach that categorizes jobs based on their prediction requirements. To achieve this, we employ one-bit "cheap predictions" to classify jobs as either short or long. SkipPredict prioritizes predicted short jobs over long jobs, and for the latter, SkipPredict applies a second round of more detailed "expensive predictions" to approximate Shortest Remaining Processing Time for these jobs. Our analysis takes into account the cost of prediction. We examine the effect of this cost for two distinct models. In the external cost model, predictions are generated by some external method without impacting job service times but incur a cost. In the server time cost model, predictions themselves require server processing time, and are scheduled on the same server as the jobs.

SkipPredict: When to Invest in Predictions for Scheduling

TL;DR

This work studies how to incorporate prediction costs into queueing-based scheduling, introducing SkipPredict, a two-stage policy for an system with cheap one-bit predictions to separate short vs long jobs and expensive predictions to estimate remaining processing times for the long group. It analyzes two cost models—external (predictions cost time but do not affect service) and server-time (prediction processing consumes server resources)—and derives exact mean response-time expressions via the SOAP framework, comparing SkipPredict to FCFS, 1bit, SPRPT, and a DelayPredict baseline. The results show SkipPredict can outperform competing policies, especially when there is a sizable gap between cheap and expensive prediction costs, and when predictions for long jobs yield substantial gains under high load. The paper also explores generalizations to non-fixed costs and offers several SkipPredict variants and practical considerations for deployment, enabling cost-aware prediction usage in scheduling systems.

Abstract

In light of recent work on scheduling with predicted job sizes, we consider the effect of the cost of predictions in queueing systems, removing the assumption in prior research that predictions are external to the system's resources and/or cost-free. In particular, we introduce a novel approach to utilizing predictions, SkipPredict, designed to address their inherent cost. Rather than uniformly applying predictions to all jobs, we propose a tailored approach that categorizes jobs based on their prediction requirements. To achieve this, we employ one-bit "cheap predictions" to classify jobs as either short or long. SkipPredict prioritizes predicted short jobs over long jobs, and for the latter, SkipPredict applies a second round of more detailed "expensive predictions" to approximate Shortest Remaining Processing Time for these jobs. Our analysis takes into account the cost of prediction. We examine the effect of this cost for two distinct models. In the external cost model, predictions are generated by some external method without impacting job service times but incur a cost. In the server time cost model, predictions themselves require server processing time, and are scheduled on the same server as the jobs.
Paper Structure (35 sections, 21 theorems, 121 equations, 7 figures, 1 table)

This paper contains 35 sections, 21 theorems, 121 equations, 7 figures, 1 table.

Key Result

Theorem 1

Under any SOAP policy, the mean response time of jobs with descriptor $d$ and size $x_J$ is:

Figures (7)

  • Figure 1: Considering prediction cost in SPRPT algorithm in M/M/1 system. The cost is the sum of mean response time and fixed prediction cost.
  • Figure 2: The SkipPredict algorithm.
  • Figure 3: Cost in the external cost and server cost models using uniform predictor, cheap predictor is configured with $\alpha = 0.8$, expensive predictor with $\alpha=0.2$, service times are exponentially distributed with mean 1. The default costs for the external model are $c_1= 0.5, c_2=2$ and for the server cost model are $c_1= 0.01, c_2=0.05$ (a + d) Cost vs. $c_2$ when $\lambda = 0.9$ and $T=1$ (b + e) Cost vs. $T$ when $\lambda = 0.9$ (c + f) Cost vs. $\lambda$ when $T=1$.
  • Figure 4: Cost in the external cost and server cost models using exponential predictor for Weibull service time. The default costs for the external model are $c_1= 0.5, c_2=2$ and for the server cost model are $c_1= 0.01, c_2=0.05$ (a + d) Cost vs. $c_2$ when $\lambda = 0.9$ and $T=1$ (b + e) Cost vs. $T$ when $\lambda = 0.9$ (c + f) Cost vs. $\lambda$ when $T=1$.
  • Figure 5: DelayPredict vs. SkipPredict and SPRPT in the external cost model (a) cost vs. arrival rate with $c_1=0.5, c_2=4$, $T=L=1$ (b) cost vs. arrival rate with $c_1=3.5, c_2=4$, $T=L=1$ (c) cost vs. threshold ($T$ for SkipPredict and $L$ for DelayPredict), $c_1=0.5, c_2=2$, $\lambda = 0.9$.
  • ...and 2 more figures

Theorems & Definitions (38)

  • Definition 1
  • Definition 2
  • Definition 3
  • Theorem 1: Theorem 5.5 of scully2018soap
  • Lemma 1
  • proof
  • Definition 4
  • Lemma 2
  • proof
  • Lemma 3
  • ...and 28 more