Deep Semantic Inference over the Air: An Efficient Task-Oriented Communication System
Chenyang Wang, Roger Olsson, Stefan Forsström, Qing He
TL;DR
This paper tackles efficient wireless classification under latency and resource constraints by proposing a deep learning–driven task-oriented communication framework that partitions neural networks between edge and cloud, and compresses intermediate semantic features. It uses ResNet-18/34 backbones on CIFAR-10/100 with AWGN channel simulation to study how partition location and semantic dimension affect accuracy and latency, revealing that substantial computational and communicational savings can be achieved while preserving a large portion of baseline performance. Key contributions include a tunable latency model, an analysis of early-to-late split points, and practical guidance on balancing edge computation with channel transmission, along with a public codebase. The findings have practical implications for edge–cloud inference in bandwidth- and energy-constrained wireless environments, and pave the way for dynamic, adaptive semantic communication in real-world deployments.
Abstract
Empowered by deep learning, semantic communication marks a paradigm shift from transmitting raw data to conveying task-relevant meaning, enabling more efficient and intelligent wireless systems. In this study, we explore a deep learning-based task-oriented communication framework that jointly considers classification performance, computational latency, and communication cost. We evaluate ResNets-based models on the CIFAR-10 and CIFAR-100 datasets to simulate real-world classification tasks in wireless environments. We partition the model at various points to simulate split inference across a wireless channel. By varying the split location and the size of the transmitted semantic feature vector, we systematically analyze the trade-offs between task accuracy and resource efficiency. Experimental results show that, with appropriate model partitioning and semantic feature compression, the system can retain over 85\% of baseline accuracy while significantly reducing both computational load and communication overhead.
