Table of Contents
Fetching ...

Distributed Nonlinear Transform Source-Channel Coding for Wireless Correlated Image Transmission

Yufei Bo, Meixia Tao

TL;DR

This work introduces Distributed Nonlinear Transform Source-Channel Coding (D-NTSCC) for correlated image transmission over wireless channels. It explicitly models source correlation with a joint entropy model and uses nonlinear transform coding to map images to latent representations, which are transmitted through JSCC channels. A variational loss combines distortion and rate terms, and a latent-alignment transformation at the receiver enhances cross-image correlation exploitation. Experiments on multi-view datasets demonstrate that D-NTSCC achieves superior rate-distortion performance and bandwidth efficiency compared to state-of-the-art distributed schemes and a point-to-point baseline, highlighting its potential for distributed semantic communications.

Abstract

This paper investigates distributed joint source-channel coding (JSCC) for correlated image semantic transmission over wireless channels. In this setup, correlated images at different transmitters are separately encoded and transmitted through dedicated channels for joint recovery at the receiver. We propose a novel distributed nonlinear transform source-channel coding (D-NTSCC) framework. Unlike existing learning-based approaches that implicitly learn source correlation in a purely data-driven manner, our method explicitly models the source correlation through joint distribution. Specifically, the correlated images are separately encoded into latent representations via an encoding transform function, followed by a JSCC encoder to produce channel input symbols. A learned joint entropy model is introduced to determine the transmission rates, which more accurately approximates the joint distribution of the latent representations and captures source dependencies, thereby improving rate-distortion performance. At the receiver, a JSCC decoder and a decoding transform function reconstruct the images from the received signals, each serving as side information for recovering the other image. Therein, a transformation module is designed to align the latent representations for maximal correlation learning. Furthermore, a loss function is derived to jointly optimize encoding, decoding, and the joint entropy model, ensuring that the learned joint entropy model approximates the true joint distribution. Experiments on multi-view datasets show that D-NTSCC outperforms state-of-the-art distributed schemes, demonstrating its effectiveness in exploiting source correlation.

Distributed Nonlinear Transform Source-Channel Coding for Wireless Correlated Image Transmission

TL;DR

This work introduces Distributed Nonlinear Transform Source-Channel Coding (D-NTSCC) for correlated image transmission over wireless channels. It explicitly models source correlation with a joint entropy model and uses nonlinear transform coding to map images to latent representations, which are transmitted through JSCC channels. A variational loss combines distortion and rate terms, and a latent-alignment transformation at the receiver enhances cross-image correlation exploitation. Experiments on multi-view datasets demonstrate that D-NTSCC achieves superior rate-distortion performance and bandwidth efficiency compared to state-of-the-art distributed schemes and a point-to-point baseline, highlighting its potential for distributed semantic communications.

Abstract

This paper investigates distributed joint source-channel coding (JSCC) for correlated image semantic transmission over wireless channels. In this setup, correlated images at different transmitters are separately encoded and transmitted through dedicated channels for joint recovery at the receiver. We propose a novel distributed nonlinear transform source-channel coding (D-NTSCC) framework. Unlike existing learning-based approaches that implicitly learn source correlation in a purely data-driven manner, our method explicitly models the source correlation through joint distribution. Specifically, the correlated images are separately encoded into latent representations via an encoding transform function, followed by a JSCC encoder to produce channel input symbols. A learned joint entropy model is introduced to determine the transmission rates, which more accurately approximates the joint distribution of the latent representations and captures source dependencies, thereby improving rate-distortion performance. At the receiver, a JSCC decoder and a decoding transform function reconstruct the images from the received signals, each serving as side information for recovering the other image. Therein, a transformation module is designed to align the latent representations for maximal correlation learning. Furthermore, a loss function is derived to jointly optimize encoding, decoding, and the joint entropy model, ensuring that the learned joint entropy model approximates the true joint distribution. Experiments on multi-view datasets show that D-NTSCC outperforms state-of-the-art distributed schemes, demonstrating its effectiveness in exploiting source correlation.

Paper Structure

This paper contains 20 sections, 1 theorem, 12 equations, 4 figures, 1 table.

Key Result

Proposition 1

The expectation of the KL divergence between $q(\mathbf{\hat{s}}_1,\mathbf{\hat{s}}_2,\mathbf{\tilde{z}}_1,\mathbf{\tilde{z}}_2|\mathbf{x}_1,\mathbf{x}_2)$ and $p(\mathbf{\hat{s}}_1,\mathbf{\hat{s}}_2,\mathbf{\tilde{z}}_1,\mathbf{\tilde{z}}_2|\mathbf{x}_1,\mathbf{x}_2)$ over the data distribution $p

Figures (4)

  • Figure 1: Overall architecture of the proposed D-NTSCC scheme.
  • Figure 2: NN architectures of nonlinear transform modules.
  • Figure 3: NN architectures of different modules.
  • Figure 4: Performances at varying transmission rates when channel SNR = 5 dB.

Theorems & Definitions (1)

  • Proposition 1