Classification-Oriented Semantic Wireless Communications
Emrecan Kutay, Aylin Yener
TL;DR
The paper addresses the challenge of transmitting semantic meaning for classification tasks over wireless channels under latency and data constraints. It compares memory-based semantic processing (semantic quantization and semantic compression) with a learning-based vector quantized autoencoder (VQ-VAE) approach, both built on pre-trained semantic extractors (SBERT/CLIP), and introduces system time efficiency $\eta_T(\tilde{T})$ to quantify time saved by avoiding training. Key findings show memory-based methods achieve higher system time efficiency and comparable accuracy to semantic-agnostic baselines, while the learning-based VQ-AE is more sensitive to training data, with results validated on text (AG's News) and image (STL-10) modalities over AWGN and Rayleigh fading channels. The work demonstrates that leveraging past realizations as codebooks enables fast, interpretable, and resource-efficient semantic communications suitable for dynamic wireless environments, with potential benefits for 6G-era task-oriented links.
Abstract
We propose semantic communication over wireless channels for various modalities, e.g., text and images, in a task-oriented communications setup where the task is classification. We present two approaches based on memory and learning. Both approaches rely on a pre-trained neural network to extract semantic information but differ in codebook construction. In the memory-based approach, we use semantic quantization and compression models, leveraging past source realizations as a codebook to eliminate the need for further training. In the learning-based approach, we use a semantic vector quantized autoencoder model that learns a codebook from scratch. Both are followed by a channel coder in order to reliably convey semantic information to the receiver (classifier) through the wireless medium. In addition to classification accuracy, we define system time efficiency as a new performance metric. Our results demonstrate that the proposed memory-based approach outperforms its learning-based counterpart with respect to system time efficiency while offering comparable accuracy to semantic agnostic conventional baselines.
