Dynamic SLA-aware Network Slice Monitoring

Niloy Saha; Mina Tahmasbi Arashloo; Nashid Shahriar; Raouf Boutaba

Dynamic SLA-aware Network Slice Monitoring

Niloy Saha, Mina Tahmasbi Arashloo, Nashid Shahriar, Raouf Boutaba

TL;DR

The paper tackles real-time SLA-aware monitoring for large-scale network slicing under telemetry budget constraints by formulating monitoring as a closed-loop control problem. It introduces the Telemetry Primitive Contract (TPC) and presents SliceScope, a practical system using change-triggered In-Band Network Telemetry (INT) to dynamically allocate monitoring resources across slices and SLA metrics. Through a joint multi-slice optimization, epoch-based adaptation, and per-slice tunable thresholds, SliceScope achieves up to 4x more accurate tracking for critical slices and outperforms alternative telemetry primitives in end-to-end SLA tracking. The evaluation spans large-scale simulations and a hardware testbed on Intel Tofino, validating the approach and its deployment considerations such as bounded header size and path-aware state management. Overall, the work demonstrates a viable, adaptive framework for SLA-aware telemetry in programmable networks and outlines directions for future enhancements and broader integration.

Abstract

Next-generation networks increasingly rely on network slices - logical networks tailored to specific application requirements, each with distinct Service-Level Agreements (SLAs). Ensuring compliance with these SLAs requires continuous, real-time monitoring of end-to-end performance metrics for each slice, within a limited telemetry budget. However, we find that existing solutions face two fundamental limitations: they either lack end-to-end visibility (e.g., sketches, probabilistic sampling) or provide visibility but lack the control mechanisms to dynamically allocate monitoring resources according to slice SLAs. We address this through a formal framework that reframes slice monitoring as a closed-loop control problem, and defines the minimal data plane requirements for SLA-aware slice monitoring via a telemetry primitive contract. We then present SliceScope, a realization of this framework that combines: (1) a control strategy that dynamically allocates the monitoring resources across diverse slices according to their SLA criticality, and (2) a data-plane based on change-triggered INT that provides per-packet end-to-end visibility with tunable accuracy-overhead trade-offs, satisfying the telemetry contract. Our evaluation results on programmable switches and in large-scale simulations with a mixture of different slice types, demonstrate that SliceScope tracks critical slices up to 4x more accurately compared to static baselines, while showing that change-triggered INT outperforms alternative primitives for realizing the telemetry primitive contract.

Dynamic SLA-aware Network Slice Monitoring

TL;DR

Abstract

Dynamic SLA-aware Network Slice Monitoring

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)