NetGPT: A Native-AI Network Architecture Beyond Provisioning Personalized Generative Services
Yuxuan Chen, Rongpeng Li, Zhifeng Zhao, Chenghui Peng, Jianjun Wu, Ekram Hossain, Honggang Zhang
TL;DR
NetGPT addresses provisioning of personalized generative services by orchestrating cloud-edge LLM collaboration within an AI-native network architecture. It demonstrates feasibility by deploying GPT-2-base at the edge and LLaMA-7B in the cloud with LoRA-based fine-tuning, and formalizes edge-to-cloud prompt enhancement as $P_{\text{com}} = \text{LLM}_{\theta}(P_{\text{con}}; \mathcal{I}_{\text{personalized}})$, with $\theta^{*}$ defined by a minimization over a dataset. The study shows substantial latency and bandwidth advantages over cloud-only deployments, while keeping edge resource usage modest (e.g., approximately $1.65$ GB VRAM) and enabling location-based personalization via edge-generated comprehensive prompts. Beyond generative services, NetGPT proposes an AI-native network architecture with converged C&C, data/privacy protections, and a logical AI workflow to unify network management tasks such as popularity prediction and intent inference. These contributions suggest a practical path toward AI-integrated network control that leverages edge-local personalization and cloud-scale reasoning, while acknowledging challenges in data privacy, online adaptation, and multi-modal extensions.
Abstract
Large language models (LLMs) have triggered tremendous success to empower our daily life by generative information. The personalization of LLMs could further contribute to their applications due to better alignment with human intents. Towards personalized generative services, a collaborative cloud-edge methodology is promising, as it facilitates the effective orchestration of heterogeneous distributed communication and computing resources. In this article, we put forward NetGPT to capably synergize appropriate LLMs at the edge and the cloud based on their computing capacity. In addition, edge LLMs could efficiently leverage location-based information for personalized prompt completion, thus benefiting the interaction with the cloud LLM. In particular, we present the feasibility of NetGPT by leveraging low-rank adaptation-based fine-tuning of open-source LLMs (i.e., GPT-2-base model and LLaMA model), and conduct comprehensive numerical comparisons with alternative cloud-edge collaboration or cloud-only techniques, so as to demonstrate the superiority of NetGPT. Subsequently, we highlight the essential changes required for an artificial intelligence (AI)-native network architecture towards NetGPT, with emphasis on deeper integration of communications and computing resources and careful calibration of logical AI workflow. Furthermore, we demonstrate several benefits of NetGPT, which come as by-products, as the edge LLMs' capability to predict trends and infer intents promises a unified solution for intelligent network management & orchestration. We argue that NetGPT is a promising AI-native network architecture for provisioning beyond personalized generative services.
