Table of Contents
Fetching ...

How long can you sleep? Idle Time System Inefficiencies and Opportunities

Georgia Antoniou, Haris Volos, Jawad Haj Yahya, Yiannakis Sazeides

TL;DR

The paper addresses wasted idle power in modern latency-sensitive servers by quantifying idle opportunity through a model-based framework. It applies queueing-theory models $M/M/1$, $cxM/M/1$, and $M/M/c$ to estimate idle-time distributions at core and package levels, and validates these predictions against measurements from a real 2-socket server with synthetic workloads. Key contributions include quantifying unrealized idle opportunities, identifying sources of inefficiency such as idle-governor inaccuracy and high transition overheads, and offering design guidance for hardware/software optimizations and early-stage exploration of configurations (e.g., core counts per package). The work enables more energy-efficient server design by pinpointing feasible deep-idle states and informing future power-management strategies, with extensibility to other system characteristics like interrupt overheads.

Abstract

This work introduces a model-based framework that reveals the idle opportunity of modern servers running latency-critical applications. Specifically, three queuing models, M/M/1, cxM/M/1, and M/M/c, are used to estimate the theoretical idle time distribution at the CPU core and system (package) level. A comparison of the actual idleness of a real server and that from the theoretical models reveals significant missed opportunities to enter deep idle states. This inefficiency is attributed to the idle-governor inaccuracy and the high latency to transition to/from legacy deep-idle states. The proposed methodology offers the means for an early-stage design exploration and insights into idle time behavior and opportunities for varying server system configurations and load.

How long can you sleep? Idle Time System Inefficiencies and Opportunities

TL;DR

The paper addresses wasted idle power in modern latency-sensitive servers by quantifying idle opportunity through a model-based framework. It applies queueing-theory models , , and to estimate idle-time distributions at core and package levels, and validates these predictions against measurements from a real 2-socket server with synthetic workloads. Key contributions include quantifying unrealized idle opportunities, identifying sources of inefficiency such as idle-governor inaccuracy and high transition overheads, and offering design guidance for hardware/software optimizations and early-stage exploration of configurations (e.g., core counts per package). The work enables more energy-efficient server design by pinpointing feasible deep-idle states and informing future power-management strategies, with extensibility to other system characteristics like interrupt overheads.

Abstract

This work introduces a model-based framework that reveals the idle opportunity of modern servers running latency-critical applications. Specifically, three queuing models, M/M/1, cxM/M/1, and M/M/c, are used to estimate the theoretical idle time distribution at the CPU core and system (package) level. A comparison of the actual idleness of a real server and that from the theoretical models reveals significant missed opportunities to enter deep idle states. This inefficiency is attributed to the idle-governor inaccuracy and the high latency to transition to/from legacy deep-idle states. The proposed methodology offers the means for an early-stage design exploration and insights into idle time behavior and opportunities for varying server system configurations and load.

Paper Structure

This paper contains 8 sections, 3 figures.

Figures (3)

  • Figure 1: High-level description of model that estimates full idle time distribution for an M/M/1, c$x$M/M/1 and M/M/c queue model.
  • Figure 2: Core C-state (M/M/1) residency for an ideal and legacy system for mean service times: 100us, 500us, 1ms, 5ms, 10ms, at 20% utilization.
  • Figure 3: Pkg C-state (cxM/M/1) residency for an ideal and legacy system for 100us and 10ms mean service times at 20% utilization.