Table of Contents
Fetching ...

Deploying AI-Based Applications with Serverless Computing in 6G Networks: An Experimental Study

Marc Michalke, Chukwuemeka Muonagor, Admela Jukan

TL;DR

The paper addresses how to deploy AI/ML workloads in 6G networks under latency and resource constraints by mapping ML tasks to a serverless edge architecture. It implements and evaluates a concrete stack using Benchfaster, k3s, and Knative to run distributed training on edge nodes and compares it to a cloud-based deployment. The results show that the 6G serverless approach maintains median model accuracy while reducing median response times by about 25%, though with increased variability in performance. This work demonstrates the viability of edge-focused serverless ML in 6G, highlighting trade-offs and outlining directions for further modeling and architecture development to enhance resource utilization and responsiveness.

Abstract

Future 6G networks are expected to heavily utilize machine learning capabilities in a wide variety of applications with features and benefits for both, the end user and the provider. While the options for utilizing these technologies are almost endless, from the perspective of network architecture and standardized service, the deployment decisions on where to execute the AI-tasks are critical, especially when considering the dynamic and heterogeneous nature of processing and connectivity capability of 6G networks. On the other hand, conceptual and standardization work is still in its infancy, as to how to categorizes ML applications in 6G landscapes; some of them are part of network management functions, some target the inference itself, while many others emphasize model training. It is likely that future mobile services may all be in the AI domain, or combined with AI. This work makes a case for the serverless computing paradigm to be used to this end. We first provide an overview of different machine learning applications that are expected to be relevant in 6G networks. We then create a set of general requirements for software engineering solutions executing these workloads from them and propose and implement a high-level edge-focused architecture to execute such tasks. We then map the ML-serverless paradigm to the case study of 6G architecture and test the resulting performance experimentally for a machine learning application against a setup created in a more traditional, cloud-based manner. Our results show that, while there is a trade-off in predictability of the response times and the accuracy, the achieved median accuracy in a 6G setup remains the same, while the median response time decreases by around 25% compared to the cloud setup.

Deploying AI-Based Applications with Serverless Computing in 6G Networks: An Experimental Study

TL;DR

The paper addresses how to deploy AI/ML workloads in 6G networks under latency and resource constraints by mapping ML tasks to a serverless edge architecture. It implements and evaluates a concrete stack using Benchfaster, k3s, and Knative to run distributed training on edge nodes and compares it to a cloud-based deployment. The results show that the 6G serverless approach maintains median model accuracy while reducing median response times by about 25%, though with increased variability in performance. This work demonstrates the viability of edge-focused serverless ML in 6G, highlighting trade-offs and outlining directions for further modeling and architecture development to enhance resource utilization and responsiveness.

Abstract

Future 6G networks are expected to heavily utilize machine learning capabilities in a wide variety of applications with features and benefits for both, the end user and the provider. While the options for utilizing these technologies are almost endless, from the perspective of network architecture and standardized service, the deployment decisions on where to execute the AI-tasks are critical, especially when considering the dynamic and heterogeneous nature of processing and connectivity capability of 6G networks. On the other hand, conceptual and standardization work is still in its infancy, as to how to categorizes ML applications in 6G landscapes; some of them are part of network management functions, some target the inference itself, while many others emphasize model training. It is likely that future mobile services may all be in the AI domain, or combined with AI. This work makes a case for the serverless computing paradigm to be used to this end. We first provide an overview of different machine learning applications that are expected to be relevant in 6G networks. We then create a set of general requirements for software engineering solutions executing these workloads from them and propose and implement a high-level edge-focused architecture to execute such tasks. We then map the ML-serverless paradigm to the case study of 6G architecture and test the resulting performance experimentally for a machine learning application against a setup created in a more traditional, cloud-based manner. Our results show that, while there is a trade-off in predictability of the response times and the accuracy, the achieved median accuracy in a 6G setup remains the same, while the median response time decreases by around 25% compared to the cloud setup.
Paper Structure (23 sections, 5 figures, 1 table)

This paper contains 23 sections, 5 figures, 1 table.

Figures (5)

  • Figure 1: Example Architecture
  • Figure 2: A 6G architecture as presented by 5G-PPPbahare_6g_2023
  • Figure 3: Mapping to 6G networks
  • Figure 4: ML Training Approaches
  • Figure 5: Experiment results