Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Bowen Sun; Christos D. Antonopoulos; Evgenia Smirni; Bin Ren; Nikolaos Bellas; Spyros Lalis

Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Bowen Sun, Christos D. Antonopoulos, Evgenia Smirni, Bin Ren, Nikolaos Bellas, Spyros Lalis

TL;DR

LACE-RL is presented, a latency-aware and carbon-efficient management framework that formulates serverless pod retention as a sequential decision problem, achieving better latency-carbon trade-offs than state-of-the-art heuristic and single-objective baselines, approaching Oracle performance.

Abstract

Serverless computing simplifies cloud deployment but introduces new challenges in managing service latency and carbon emissions. Reducing cold-start latency requires retaining warm function instances, while minimizing carbon emissions favors reclaiming idle resources. This balance is further complicated by time-varying grid carbon intensity and varying workload patterns, under which static keep-alive policies are inefficient. We present LACE-RL, a latency-aware and carbon-efficient management framework that formulates serverless pod retention as a sequential decision problem. LACE-RL uses deep reinforcement learning to dynamically tune keep-alive durations, jointly modeling cold-start probability, function-specific latency costs, and real-time carbon intensity. Using the Huawei Public Cloud Trace, we show that LACE-RL reduces cold starts by 51.69% and idle keep-alive carbon emissions by 77.08% compared to Huawei's static policy, while achieving better latency-carbon trade-offs than state-of-the-art heuristic and single-objective baselines, approaching Oracle performance.

Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

TL;DR

Abstract

Paper Structure (24 sections, 6 equations, 10 figures, 3 tables)

This paper contains 24 sections, 6 equations, 10 figures, 3 tables.

Introduction
Energy Modeling and Challenges
Huawei Cloud Dataset
Energy Model: Accounting for Serverless Carbon Footprint
Challenges in Serverless Management
System Design
LACE-RL Overview
Carbon-Latency Tradeoff via Keep-Alive Tuning
Reinforcement-Learning Model Formulation
Evaluation
Experimental Methodology
Model and Parameters Validation via Real-System Measurements
Dataset and Trace Processing
Energy and Carbon Modeling
RL Model Configuration
...and 9 more sections

Figures (10)

Figure 1: Characterization of the Huawei Cloud Trace joosen2025serverless.
Figure 2: Impact of keep-alive timeout for two different but representative functions in the Huawei dataset. In both cases, longer timeouts reduce cold starts but increase idle carbon footprint. Depending on the function, idle carbon may even surpass (significantly) the carbon for execution (right plot).
Figure 3: Observed variability in carbon intensity of the electrical grid electricitymaps(a) and Huawei's function memory footprint (b).
Figure 4: LACE-RL overview.
Figure 5: General testing workload results across key metrics. The dashed line shows the optimal value.
...and 5 more figures

Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

TL;DR

Abstract

Green or Fast? Learning to Balance Cold Starts and Idle Carbon in Serverless Computing

Authors

TL;DR

Abstract

Table of Contents

Figures (10)