Distributionally Robust Wireless Semantic Communication with Large AI Models
Long Tan Le, Senura Hansaja Wanasekara, Zerun Niu, Nguyen H. Tran, Phuong Vo, Walid Saad, Dusit Niyato, Zhu Han, Choong Seon Hong, H. Vincent Poor
TL;DR
This work addresses robustness in semantic communication for wireless networks by introducing WaSeCom, a bilevel framework based on WDRO to hedge both semantic interpretation errors and channel distortions. It leverages dual WDRO formulations and log-sum-exp smoothing to enable scalable training of large AI backbones (e.g., ViT, BERT) for text and image modalities, with two independent Wasserstein ambiguity sets for semantic inputs and channel outputs. Theoretical results establish generalization guarantees and convergence to robust stationary points, while empirical results on CIFAR-10 and Europarl demonstrate improved robustness under semantic perturbations and wireless channel variability without sacrificing nominal performance. The approach offers a model-agnostic, end-to-end solution that enhances semantic fidelity under adverse wireless conditions, with practical guidelines for hyperparameter tuning and potential extensions to time-varying channels.
Abstract
Semantic communication (SemCom) has emerged as a promising paradigm for 6G wireless systems by transmitting task-relevant information rather than raw bits, yet existing approaches remain vulnerable to dual sources of uncertainty: semantic misinterpretation arising from imperfect feature extraction and transmission-level perturbations from channel noise. Current deep learning based SemCom systems typically employ domain-specific architectures that lack robustness guarantees and fail to generalize across diverse noise conditions, adversarial attacks, and out-of-distribution data. In this paper, a novel and generalized semantic communication framework called WaSeCom is proposed to systematically address uncertainty and enhance robustness. In particular, Wasserstein distributionally robust optimization is employed to provide resilience against semantic misinterpretation and channel perturbations. A rigorous theoretical analysis is performed to establish the robust generalization guarantees of the proposed framework. Experimental results on image and text transmission demonstrate that WaSeCom achieves improved robustness under noise and adversarial perturbations. These results highlight its effectiveness in preserving semantic fidelity across varying wireless conditions.
