Table of Contents
Fetching ...

Key Considerations for Auto-Scaling: Lessons from Benchmark Microservices

Majid Dashtbani, Ladan Tahvildari

TL;DR

Auto-scaling in microservices is hindered by design and deployment gaps not captured by standard benchmarks. The authors empirically evaluate six autoscaling methods on the Sock-Shop benchmark, organizing challenges into Architecture, Implementation, and Deployment phases to diagnose practical pitfalls. They demonstrate that lifecycle-aware engineering, including topology-aware scaling and improved observability, yields more stable and efficient scaling, with PBScaler outperforming threshold-based approaches when design gaps are addressed. The work provides actionable guidelines and a lifecycle-oriented checklist to enable robust auto-scaling in production microservice systems.

Abstract

Microservices have become the dominant architectural paradigm for building scalable and modular cloud-native systems. However, achieving effective auto-scaling in such systems remains a non-trivial challenge, as it depends not only on advanced scaling techniques but also on sound design, implementation, and deployment practices. Yet, these foundational aspects are often overlooked in existing benchmarks, making it difficult to evaluate autoscaling methods under realistic conditions. In this paper, we identify a set of practical auto-scaling considerations by applying several state-of-the-art autoscaling methods to widely used microservice benchmarks. To structure these findings, we classify the issues based on when they arise during the software lifecycle: Architecture, Implementation, and Deployment. The Architecture phase covers high-level decisions such as service decomposition and inter-service dependencies. The Implementation phase includes aspects like initialization overhead, metrics instrumentation, and error propagation. The Deployment phase focuses on runtime configurations such as resource limits and health checks. We validate these considerations using the Sock-Shop benchmark and evaluate diverse auto-scaling strategies, including threshold-based, control-theoretic, learning-based, black-box optimization, and dependency-aware approaches. Our findings show that overlooking key lifecycle concerns can degrade autoscaler performance, while addressing them leads to more stable and efficient scaling. These results underscore the importance of lifecycle-aware engineering for unlocking the full potential of auto-scaling in microservice-based systems.

Key Considerations for Auto-Scaling: Lessons from Benchmark Microservices

TL;DR

Auto-scaling in microservices is hindered by design and deployment gaps not captured by standard benchmarks. The authors empirically evaluate six autoscaling methods on the Sock-Shop benchmark, organizing challenges into Architecture, Implementation, and Deployment phases to diagnose practical pitfalls. They demonstrate that lifecycle-aware engineering, including topology-aware scaling and improved observability, yields more stable and efficient scaling, with PBScaler outperforming threshold-based approaches when design gaps are addressed. The work provides actionable guidelines and a lifecycle-oriented checklist to enable robust auto-scaling in production microservice systems.

Abstract

Microservices have become the dominant architectural paradigm for building scalable and modular cloud-native systems. However, achieving effective auto-scaling in such systems remains a non-trivial challenge, as it depends not only on advanced scaling techniques but also on sound design, implementation, and deployment practices. Yet, these foundational aspects are often overlooked in existing benchmarks, making it difficult to evaluate autoscaling methods under realistic conditions. In this paper, we identify a set of practical auto-scaling considerations by applying several state-of-the-art autoscaling methods to widely used microservice benchmarks. To structure these findings, we classify the issues based on when they arise during the software lifecycle: Architecture, Implementation, and Deployment. The Architecture phase covers high-level decisions such as service decomposition and inter-service dependencies. The Implementation phase includes aspects like initialization overhead, metrics instrumentation, and error propagation. The Deployment phase focuses on runtime configurations such as resource limits and health checks. We validate these considerations using the Sock-Shop benchmark and evaluate diverse auto-scaling strategies, including threshold-based, control-theoretic, learning-based, black-box optimization, and dependency-aware approaches. Our findings show that overlooking key lifecycle concerns can degrade autoscaler performance, while addressing them leads to more stable and efficient scaling. These results underscore the importance of lifecycle-aware engineering for unlocking the full potential of auto-scaling in microservice-based systems.

Paper Structure

This paper contains 12 sections, 10 figures, 5 tables.

Figures (10)

  • Figure 1: Impact of heavy services on auto-scaling performance. (a) and (b) demonstrate how service initialization overhead triggers false-positive scale-outs under KHPA - Max pods: 20; Resource configuration: Requests: 100m, Limits: 300m.
  • Figure 2: CPU contention - parallel startup of multiple replicas.
  • Figure 3: Front-end code masks Carts failure with HTTP 200.
  • Figure 4: Error masking lowers response time, misleading the autoscaler.
  • Figure 5: “/login” chain: (A) Business logic view; (B) Runtime invocation view.
  • ...and 5 more figures