Towards Edge General Intelligence: Knowledge Distillation for Mobile Agentic AI
Yuxuan Wu, Linghan Ma, Ruichen Zhang, Yinqiu Liu, Dusit Niyato, Shunpu Tang, Zehui Xiong, Zhu Han, Zhaohui Yang, Kaibin Huang, Zhaoyang Zhang, Kai-Kit Wong
TL;DR
This paper defines Edge General Intelligence (EGI) as cloud-like general cognition at the wireless edge and identifies the deployment chasm imposed by resource constraints. It advocates Knowledge Distillation (KD) as a central strategy to transfer the reasoning and control capabilities of large teachers to compact edge models, enabling on-device perception, planning, action, and memory. The authors systematically categorize KD techniques (response-, feature-, and relation-based) and map them to agentic AI components, while discussing architectures like Mamba and RWKV and cross-architecture distillation to close the performance gap. They survey KD-enabled wireless tasks (e.g., channel estimation, CSI feedback, modulation classification) and domain applications (UAVs, autonomous vehicles, robotics, IoT), and highlight challenges around benchmarking, robustness, and ethics with future directions toward safety-aware, modality-agnostic, collaborative KD at the edge. The work provides a comprehensive reference for advancing KD-driven mobile agentic AI toward practical, edge-resident general intelligence.
Abstract
Edge General Intelligence (EGI) represents a paradigm shift in mobile edge computing, where intelligent agents operate autonomously in dynamic, resource-constrained environments. However, the deployment of advanced agentic AI models on mobile and edge devices faces significant challenges due to limited computation, energy, and storage resources. To address these constraints, this survey investigates the integration of Knowledge Distillation (KD) into EGI, positioning KD as a key enabler for efficient, communication-aware, and scalable intelligence at the wireless edge. In particular, we emphasize KD techniques specifically designed for wireless communication and mobile networking, such as channel-aware self-distillation, cross-model Channel State Information (CSI) feedback distillation, and robust modulation/classification distillation. Furthermore, we review novel architectures natively suited for KD and edge deployment, such as Mamba, RWKV (Receptance, Weight, Key, Value) and Cross-Architecture distillation, which enhance generalization capabilities. Subsequently, we examine diverse applications in which KD-driven architectures enable EGI across vision, speech, and multimodal tasks. Finally, we highlight the key challenges and future directions for KD in EGI. This survey aims to provide a comprehensive reference for researchers exploring KD-driven frameworks for mobile agentic AI in the era of EGI.
