Table of Contents
Fetching ...

Se-HiLo: Noise-Resilient Semantic Communication with High-and-Low Frequency Decomposition

Zhiyuan Xi, Kun Zhu, Yuanyuan Xu

TL;DR

Se-HiLo addresses unpredictable semantic noise in image transmission by replacing adversarial training with a Finite Scalar Quantization (FSQ) module and augmenting it with a transformer-based High-and-Low Frequency Decomposition (HiLo). FSQ enforces robust discrete representations, and HiLo decouples information into high- and low-frequency components mapped to separate FSQ spaces to preserve representational diversity. The approach shows improved noise resilience and reconstruction accuracy across diverse SNRs, outperforming baseline semantic communication methods and avoiding unstable training dynamics associated with adversarial strategies. This has practical implications for scalable, noise-robust semantic transmission in real-world communication systems where noise patterns are unpredictable.

Abstract

Semantic communication has emerged as a transformative paradigm in next-generation communication systems, leveraging advanced artificial intelligence (AI) models to extract and transmit semantic representations for efficient information exchange. Nevertheless, the presence of unpredictable semantic noise, such as ambiguity and distortions in transmitted representations, often undermines the reliability of received information. Conventional approaches primarily adopt adversarial training with noise injection to mitigate the adverse effects of noise. However, such methods exhibit limited adaptability to varying noise levels and impose additional computational overhead during model training. To address these challenges, this paper proposes Noise-Resilient \textbf{Se}mantic Communication with \textbf{Hi}gh-and-\textbf{Lo}w Frequency Decomposition (Se-HiLo) for image transmission. The proposed Se-HiLo incorporates a Finite Scalar Quantization (FSQ) based noise-resilient module, which bypasses adversarial training by enforcing encoded representations within predefined spaces to enhance noise resilience. While FSQ improves robustness, it compromise representational diversity. To alleviate this trade-off, we adopt a transformer-based high-and-low frequency decomposition module that decouples image representations into high-and-low frequency components, mapping them into separate FSQ representation spaces to preserve representational diversity. Extensive experiments demonstrate that Se-HiLo achieves superior noise resilience and ensures accurate semantic communication across diverse noise environments.

Se-HiLo: Noise-Resilient Semantic Communication with High-and-Low Frequency Decomposition

TL;DR

Se-HiLo addresses unpredictable semantic noise in image transmission by replacing adversarial training with a Finite Scalar Quantization (FSQ) module and augmenting it with a transformer-based High-and-Low Frequency Decomposition (HiLo). FSQ enforces robust discrete representations, and HiLo decouples information into high- and low-frequency components mapped to separate FSQ spaces to preserve representational diversity. The approach shows improved noise resilience and reconstruction accuracy across diverse SNRs, outperforming baseline semantic communication methods and avoiding unstable training dynamics associated with adversarial strategies. This has practical implications for scalable, noise-robust semantic transmission in real-world communication systems where noise patterns are unpredictable.

Abstract

Semantic communication has emerged as a transformative paradigm in next-generation communication systems, leveraging advanced artificial intelligence (AI) models to extract and transmit semantic representations for efficient information exchange. Nevertheless, the presence of unpredictable semantic noise, such as ambiguity and distortions in transmitted representations, often undermines the reliability of received information. Conventional approaches primarily adopt adversarial training with noise injection to mitigate the adverse effects of noise. However, such methods exhibit limited adaptability to varying noise levels and impose additional computational overhead during model training. To address these challenges, this paper proposes Noise-Resilient \textbf{Se}mantic Communication with \textbf{Hi}gh-and-\textbf{Lo}w Frequency Decomposition (Se-HiLo) for image transmission. The proposed Se-HiLo incorporates a Finite Scalar Quantization (FSQ) based noise-resilient module, which bypasses adversarial training by enforcing encoded representations within predefined spaces to enhance noise resilience. While FSQ improves robustness, it compromise representational diversity. To alleviate this trade-off, we adopt a transformer-based high-and-low frequency decomposition module that decouples image representations into high-and-low frequency components, mapping them into separate FSQ representation spaces to preserve representational diversity. Extensive experiments demonstrate that Se-HiLo achieves superior noise resilience and ensures accurate semantic communication across diverse noise environments.

Paper Structure

This paper contains 29 sections, 2 theorems, 17 equations, 11 figures.

Key Result

Theorem 1

For a single-dimension input $x_i$ quantized into $m_i$ levels over $[L, U]$ with step size $\Delta$ under additive white Gaussian noise (AWGN) $\eta_i \sim \mathcal{N}(0, \sigma^2)$, the probability of correct quantization is:

Figures (11)

  • Figure 1: The Impact of Semantic Noise
  • Figure 2: t-SNE Visualization: ResNet34 (Top) vs. ResNet34 with FSQ (Bottom) under Varying SNRs
  • Figure 3: Noise-Resilient Semantic Communication Framework with High-and-Low-Frequency Decomposition
  • Figure 4: Finite Scalar Quantization
  • Figure 5: High-and-Low-Frequency Decomposition Block
  • ...and 6 more figures

Theorems & Definitions (4)

  • Theorem 1
  • proof
  • Theorem 2
  • proof