Unveiling Overlooked Performance Variance in Serverless Computing

Jinfeng Wen; Zhenpeng Chen; Federica Sarro; Shangguang Wang

Unveiling Overlooked Performance Variance in Serverless Computing

Jinfeng Wen, Zhenpeng Chen, Federica Sarro, Shangguang Wang

TL;DR

This paper addresses the overlooked problem of performance variance in serverless computing. By combining a literature survey of 99 papers with an empirical measurement of 72 serverless functions across multiple runs, it reveals large end-to-end latency variance (up to 338.76% with an average around 44%), and shows that most studies use too few repetitions, jeopardizing reproducibility. The work provides a detailed characterization of variance (via CV and boxplots), demonstrates that warm starts often exhibit greater variability, and highlights that reliable performance requires substantially more repetitions than commonly used in the literature. The findings carry practical implications for researchers, developers, and cloud providers, urging variance-aware benchmarking and offering a replication package to foster reproducible, reliable serverless performance evaluation.

Abstract

Serverless computing is an emerging cloud computing paradigm for developing applications at the function level, known as serverless functions. Due to the highly dynamic execution environment, multiple identical runs of the same serverless function can yield different performance, specifically in terms of end-to-end response latency. However, surprisingly, our analysis of serverless computing-related papers published in top-tier conferences highlights that the research community lacks awareness of the performance variance problem, with only 38.38% of these papers employing multiple runs for quantifying it. To further investigate, we analyze the performance of 72 serverless functions collected from these papers. Our findings reveal that the performance of these serverless functions can differ by up to 338.76% (44.28% on average) across different runs. Moreover, 61.11% of these functions produce unreliable performance results, with a low number of repetitions commonly employed in the serverless computing literature. Our study highlights a lack of awareness in the serverless computing community regarding the well-known performance variance problem in software engineering. The empirical results illustrate the substantial magnitude of this variance, emphasizing that ignoring the variance can affect research reproducibility and result reliability.

Unveiling Overlooked Performance Variance in Serverless Computing

TL;DR

Abstract

Paper Structure (16 sections, 13 figures, 4 tables)

This paper contains 16 sections, 13 figures, 4 tables.

Introduction
Background
Serverless computing
Performance of serverless functions
Methodology
Research questions
Collection of relevant papers
Collection and analysis of serverless functions
RQ1: Current Literature
RQ2: Variance Measurement
RQ3: Reliability and Repetitions
Implications
Threats to validity
Related Work
Conclusion
...and 1 more sections

Figures (13)

Figure 1: The process of developing, deploying, and executing serverless functions by software developers.
Figure 2: An overview of our methodology.
Figure 3: Number of serverless computing-related papers in top-tier conferences per year.
Figure 4: Distribution of 110 serverless computing-related papers per computer science area.
Figure 5: (RQ1) Distribution of the number of repetitions used in the research papers that report experiment repetitions.
...and 8 more figures

Unveiling Overlooked Performance Variance in Serverless Computing

TL;DR

Abstract

Unveiling Overlooked Performance Variance in Serverless Computing

Authors

TL;DR

Abstract

Table of Contents

Figures (13)