Building AI Service Repositories for On-Demand Service Orchestration in 6G AI-RAN
Yun Tang, Mengbang Zou, Udhaya Chandhar Srinivasan, Obumneme Umealor, Dennis Kevogo, Benjamin James Scott, Weisi Guo
TL;DR
The paper addresses the challenge of orchestrating AI services in 6G AI-RAN by proposing a comprehensive, open-source, LLM-assisted toolchain to automate packaging, deployment, and runtime profiling of AI services. It provides a taxonomy of orchestration attributes (Functionality, Resource, Latency, Flexibility, Trustworthiness, Billing) and demonstrates how the toolchain extracts metadata, generates deployment-ready code, and profiles runtime behavior. The Cranfield AI Service Repository serves as a proof-of-concept, showing significant reductions in manual coding and the need for infrastructure-aware profiling. The work enables more practical, scalable on-demand AI service orchestration in 6G environments and lays groundwork for future extensions including safety, security, privacy, and federated AI workflows.
Abstract
Efficient orchestration of AI services in 6G AI-RAN requires well-structured, ready-to-deploy AI service repositories combined with orchestration methods adaptive to diverse runtime contexts across radio access, edge, and cloud layers. Current literature lacks comprehensive frameworks for constructing such repositories and generally overlooks key practical orchestration factors. This paper systematically identifies and categorizes critical attributes influencing AI service orchestration in 6G networks and introduces an open-source, LLM-assisted toolchain that automates service packaging, deployment, and runtime profiling. We validate the proposed toolchain through the Cranfield AI Service repository case study, demonstrating significant automation benefits, reduced manual coding efforts, and the necessity of infrastructure-specific profiling, paving the way for more practical orchestration frameworks.
