Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

Loc X. Nguyen; Kitae Kim; Ye Lin Tun; Sheikh Salman Hassan; Yan Kyaw Tun; Zhu Han; Choong Seon Hong

Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

Loc X. Nguyen, Kitae Kim, Ye Lin Tun, Sheikh Salman Hassan, Yan Kyaw Tun, Zhu Han, Choong Seon Hong

TL;DR

The paper tackles multi-user semantic communication in a downlink setting where users have heterogeneous computing resources. It adopts a single base station encoder with Swin-transformer–based decoders at two users and introduces a two-stage training regimen that first trains with the high-capacity decoder, then freezes the encoder to train the low-capacity decoder, enhancing stability and speed. Two enhancements—partially transferring weights from the high- to low-capacity decoder and knowledge distillation from the high-capacity teacher to the low-capacity student—are proposed and evaluated. Experiments on the DIV2K dataset show improved reconstruction quality (PSNR) across SNRs and faster convergence, demonstrating scalable, resource-aware semantic communication for multi-user downlink networks. The practical impact lies in enabling efficient, high-quality semantic transmissions to devices with varying computing capabilities without requiring multiple encoders at the transmitter.

Abstract

Semantic communication, notable for ensuring quality of service by jointly optimizing source and channel coding, effectively extracts data semantics, reduces transmission length, and mitigates channel noise. However, most studies overlook multi-user scenarios and resource availability, limiting real-world application. This paper addresses this gap by focusing on downlink communication from a base station to multiple users with varying computing capacities. Users employ variants of Swin transformer models for source decoding and a simple architecture for channel decoding. We propose a novel training regimen, incorporating transfer learning and knowledge distillation to improve low-computing users' performance. Extensive simulations validate the proposed methods.

Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

TL;DR

Abstract

Paper Structure (12 sections, 5 equations, 5 figures, 1 algorithm)

This paper contains 12 sections, 5 equations, 5 figures, 1 algorithm.

Introduction
System Model
Multi-users in Semantic Communication
Proposed Training Procedure
Partially Transfer Learning between Users
Knowledge Distillation from the Teacher to the Student
Simulation Results
Model and Dataset
Benchmark
Evaluating Convergence Speed of the Proposed Procedure for Low-computing Decoder
Partially Transfer Learning and Knowledge Distillation Evaluation
Conclusion

Figures (5)

Figure 1: The system model includes one transmitter and two receivers. The difference in the number of Swin-Transformer blocks leads to a difference in computing capacities between receivers.
Figure 2: The PSNR values of the proposed procedure, iterative training, and the train alone under SNR = 3dB.
Figure 3: The effects of transfer learning and knowledge distillation on the performance of LCD under SNR = 3dB.
Figure 4: The PSNR values of LCD after 500 epochs under different channel conditions.
Figure 5: The image reconstructed by different training methods under the SNR = 3dB.

Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

TL;DR

Abstract

Optimizing Multi-User Semantic Communication via Transfer Learning and Knowledge Distillation

Authors

TL;DR

Abstract

Table of Contents

Figures (5)