Context-Aware Semantic Communication for the Wireless Networks
Guangyuan Liu, Yinqiu Liu, Jiacheng Wang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour
TL;DR
CaSemCom tackles the challenge of transmitting only task-relevant semantic content over dynamic wireless links, addressing inefficiencies of static SemCom pipelines. It introduces an LLM-based gating mechanism combined with a Mixture of Experts to dynamically select input content and activate modality-specific encoders, optimizing semantic fidelity under bandwidth and latency constraints. The paper demonstrates a multimodal, multi-user case study showing faster convergence, higher $SSIM$ and $ImageReward$, and reduced retransmission overhead under Rayleigh fading with $SNR$ in $[-13,30]$ dB. The work advances context-aware SemCom by unifying linguistic, wireless, and semantic cues and enables scalable, reusable semantic encoders for future wireless systems. The practical impact lies in supporting real-time AR/VR, autonomous driving, and Metaverse services with lower bandwidth while preserving meaning.
Abstract
In next-generation wireless networks, supporting real-time applications such as augmented reality, autonomous driving, and immersive Metaverse services demands stringent constraints on bandwidth, latency, and reliability. Existing semantic communication (SemCom) approaches typically rely on static models, overlooking dynamic conditions and contextual cues vital for efficient transmission. To address these challenges, we propose CaSemCom, a context-aware SemCom framework that leverages a Large Language Model (LLM)-based gating mechanism and a Mixture of Experts (MoE) architecture to adaptively select and encode only high-impact semantic features across multiple data modalities. Our multimodal, multi-user case study demonstrates that CaSemCom significantly improves reconstructed image fidelity while reducing bandwidth usage, outperforming single-agent deep reinforcement learning (DRL) methods and traditional baselines in convergence speed, semantic accuracy, and retransmission overhead.
