SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

Cheryl Lee; Zhouruixing Zhu; Tianyi Yang; Yintong Huo; Yuxin Su; Pinjia He; Michael R. Lyu

SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

Cheryl Lee, Zhouruixing Zhu, Tianyi Yang, Yintong Huo, Yuxin Su, Pinjia He, Michael R. Lyu

TL;DR

The paper addresses the persistent cold-start latency in serverless Function-as-a-Service by proposing SPES, a differentiated scheduler that leverages invocation pattern analysis to predict and provision function instances. SPES categorizes functions into five deterministic types and uses adaptive strategies to handle concept drift, including a co-occurrence-based correlation metric to connect unknown/infrequently invoked functions with known ones. The approach yields substantial improvements in the 75th percentile cold-start rate (approximately 49.77% reduction) and wasted memory time (approximately 56.43% reduction) compared with baselines, while maintaining modest overhead. By enabling accurate next-invocation prediction without extensive training data, SPES offers a scalable, developer-free mechanism to optimize latency-resource trade-offs in real-world serverless deployments, with publicly available code for reproducibility and further study.

Abstract

As an emerging cloud computing deployment paradigm, serverless computing is gaining traction due to its efficiency and ability to harness on-demand cloud resources. However, a significant hurdle remains in the form of the cold start problem, causing latency when launching new function instances from scratch. Existing solutions tend to use over-simplistic strategies for function pre-loading/unloading without full invocation pattern exploitation, rendering unsatisfactory optimization of the trade-off between cold start latency and resource waste. To bridge this gap, we propose SPES, the first differentiated scheduler for runtime cold start mitigation by optimizing serverless function provision. Our insight is that the common architecture of serverless systems prompts the concentration of certain invocation patterns, leading to predictable invocation behaviors. This allows us to categorize functions and pre-load/unload proper function instances with finer-grained strategies based on accurate invocation prediction. Experiments demonstrate the success of SPES in optimizing serverless function provision on both sides: reducing the 75th-percentile cold start rates by 49.77% and the wasted memory time by 56.43%, compared to the state-of-the-art. By mitigating the cold start issue, SPES is a promising advancement in facilitating cloud services deployed on serverless architectures.

SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

TL;DR

Abstract

Paper Structure (50 sections, 15 figures, 1 table, 1 algorithm)

This paper contains 50 sections, 15 figures, 1 table, 1 algorithm.

INTRODUCTION
Background
Serverless Computing
Cold Start Challenge
Preliminary Empirical Analysis
Challenges of Serverless Function Provision
Efficiency requirement
Scalability under invocation spikes
Imbalance in invocation distribution
Evolution in invocation behavior
Observations and Our Insight
Invocation pattern and triggers
Application workflow
Temporal locality in invocations
METHODOLOGY
...and 35 more sections

Figures (15)

Figure 1: An weather inquiry website based on a serverless web application.
Figure 2: A serverless function's lifecycle.
Figure 3: The distribution of function invocations. The x-axis represents the range of function invocation counts, and the y-axis indicates the number of functions falling into the corresponding ranges.
Figure 4: The function invocations can experience distinct concept shifts that may degrade the provision performance. Different colors distinguish changes over time in the function invocation patterns.
Figure 5: The proportion of trigger types among functions.
...and 10 more figures

SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

TL;DR

Abstract

SPES: Towards Optimizing Performance-Resource Trade-Off for Serverless Functions

Authors

TL;DR

Abstract

Table of Contents

Figures (15)