Table of Contents
Fetching ...

Adaptive Semantic Communication for Wireless Image Transmission Leveraging Mixture-of-Experts Mechanism

Haowen Wan, Qianqian Yang

Abstract

Deep learning based semantic communication has achieved significant progress in wireless image transmission, but most existing schemes rely on fixed models and thus lack robustness to diverse image contents and dynamic channel conditions. To improve adaptability, recent studies have developed adaptive semantic communication strategies that adjust transmission or model behavior according to either source content or channel state. More recently, MoE-based semantic communication has emerged as a sparse and efficient adaptive architecture, although existing designs still mainly rely on single-driven routing. To address this limitation, we propose a novel multi-stage end-to-end image semantic communication system for multi-input multi-output (MIMO) channels, built upon an adaptive MoE Swin Transformer block. Specifically, we introduce a dynamic expert gating mechanism that jointly evaluates both real-time CSI and the semantic content of input image patches to compute adaptive routing probabilities. By selectively activating only a specialized subset of experts based on this joint condition, our approach breaks the rigid coupling of traditional adaptive methods and overcomes the bottlenecks of single-driven routing. Simulation results indicate a significant improvement in reconstruction quality over existing methods while maintaining the transmission efficiency.

Adaptive Semantic Communication for Wireless Image Transmission Leveraging Mixture-of-Experts Mechanism

Abstract

Deep learning based semantic communication has achieved significant progress in wireless image transmission, but most existing schemes rely on fixed models and thus lack robustness to diverse image contents and dynamic channel conditions. To improve adaptability, recent studies have developed adaptive semantic communication strategies that adjust transmission or model behavior according to either source content or channel state. More recently, MoE-based semantic communication has emerged as a sparse and efficient adaptive architecture, although existing designs still mainly rely on single-driven routing. To address this limitation, we propose a novel multi-stage end-to-end image semantic communication system for multi-input multi-output (MIMO) channels, built upon an adaptive MoE Swin Transformer block. Specifically, we introduce a dynamic expert gating mechanism that jointly evaluates both real-time CSI and the semantic content of input image patches to compute adaptive routing probabilities. By selectively activating only a specialized subset of experts based on this joint condition, our approach breaks the rigid coupling of traditional adaptive methods and overcomes the bottlenecks of single-driven routing. Simulation results indicate a significant improvement in reconstruction quality over existing methods while maintaining the transmission efficiency.

Paper Structure

This paper contains 9 sections, 8 equations, 5 figures.

Figures (5)

  • Figure 1: The overall structure of the proposed scheme for wireless image transmission.
  • Figure 2: (a) The specific architecture of two successive AD-MoE STBlock. (b) The architecture of AD-MoE MLP Block. $E^s$, $E^r$ denotes the number of shared experts and the number of routed experts in AD-MOE MLP blocks respectively.
  • Figure 3: (a)$\sim$(b) PSNR and LPIPS performance of different models versus SNR under MIMO fading channels for the Kodak dataset, with R of 0.0833; results are shown for $2\times2$ and $8\times8$ transmit-receive antenna configurations. (c)$\sim$(d) PSNR and LPIPS performance of different models versus R under MIMO fading channel of Kodak Dataset, with SNR of 10 dB; results are shown for $2\times2$ and $8\times8$ transmit-receive antenna configurations.
  • Figure 4: Examples of visual comparison under MIMO fading channel at SNR = 10dB, R = 0.0833, $N_t$ = 8, $N_r$ = 8. The first column, second column, and third to fifth column shows the original image, original patch, and reconstructions of different transmission schemes, respectively.
  • Figure 5: Expert activation frequency in the last AD-MoE STBlock of the final encoder layer, evaluated on the CLIC2021 dataset.