Large Model Enabled Embodied Intelligence for 6G Integrated Perception, Communication, and Computation Network
Zhuoran Li, Zhen Gao, Xinhua Liu, Zheng Wang, Xiaotian Zhou, Lei Liu, Yongpeng Wu, Wei Feng, Yongming Huang
TL;DR
This work proposes large-model-enabled embodied intelligent base station agents (IBSAs) for 6G, embedding perception, cognition, and action at the base station level to create a closed-loop perception-communication-computation system. It introduces a three-layer IBSA architecture supported by cloud-edge-end collaboration and digital twins, demonstrated in two safety-critical scenarios: cooperative autonomous driving and low-altitude UAV safety. The paper outlines enabling technologies (cognitive cores, edge deployment, privacy/security, and digital twins) and a holistic evaluation framework with the IBSA-Score to quantify performance across perception, network, and agent dimensions. It also discusses generalization to Industrial IoT and Smart City contexts, along with challenges in benchmarks, continual adaptation, standardization, and trustworthy AI for practical deployment.
Abstract
The advent of sixth-generation (6G) places intelligence at the core of wireless architecture, fusing perception, communication, and computation into a single closed-loop. This paper argues that large artificial intelligence models (LAMs) can endow base stations with perception, reasoning, and acting capabilities, thus transforming them into intelligent base station agents (IBSAs). We first review the historical evolution of BSs from single-functional analog infrastructure to distributed, software-defined, and finally LAM-empowered IBSA, highlighting the accompanying changes in architecture, hardware platforms, and deployment. We then present an IBSA architecture that couples a perception-cognition-execution pipeline with cloud-edge-end collaboration and parameter-efficient adaptation. Subsequently,we study two representative scenarios: (i) cooperative vehicle-road perception for autonomous driving, and (ii) ubiquitous base station support for low-altitude uncrewed aerial vehicle safety monitoring and response against unauthorized drones. On this basis, we analyze key enabling technologies spanning LAM design and training, efficient edge-cloud inference, multi-modal perception and actuation, as well as trustworthy security and governance. We further propose a holistic evaluation framework and benchmark considerations that jointly cover communication performance, perception accuracy, decision-making reliability, safety, and energy efficiency. Finally, we distill open challenges on benchmarks, continual adaptation, trustworthy decision-making, and standardization. Together, this work positions LAM-enabled IBSAs as a practical path toward integrated perception, communication, and computation native, safety-critical 6G systems.
