Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels
Noel Teku, Sudarshan Adiga, Ravi Tandon
TL;DR
This work addresses the problem of transmitting classifier decisions, represented as a probability vector $\mathbf{p}$, over noisy channels under a latency constraint. It introduces a latency-distortion framework based on $f$-divergence, and analyzes three quantization schemes—Uniform (UQ), Lattice (LQ), and Sparse Lattice (SLQ)—to bound source distortion and bit budgets. By combining these bounds with finite-blocklength channel coding results, the authors derive end-to-end latency formulas and optimize over the source distortion $\beta_s$ to meet a total distortion target $\beta_t$, including extensions to fading channels with/without CSI. Experiments on AWGN and Rayleigh fading channels, using datasets like CIFAR-100 and Imagenet-1K, show that SLQ consistently minimizes latency, especially for high-dimensional probability vectors, achieving up to large fractional reductions compared to UQ/LQ. The findings underscore the importance of joint source-channel design for low-latency semantic communications in ML-enabled systems.
Abstract
In this work, the problem of communicating decisions of a classifier over a noisy channel is considered. With machine learning based models being used in variety of time-sensitive applications, transmission of these decisions in a reliable and timely manner is of significant importance. To this end, we study the scenario where a probability vector (representing the decisions of a classifier) at the transmitter, needs to be transmitted over a noisy channel. Assuming that the distortion between the original probability vector and the reconstructed one at the receiver is measured via f-divergence, we study the trade-off between transmission latency and the distortion. We completely analyze this trade-off using uniform, lattice, and sparse lattice-based quantization techniques to encode the probability vector by first characterizing bit budgets for each technique given a requirement on the allowed source distortion. These bounds are then combined with results from finite-blocklength literature to provide a framework for analyzing the effects of both quantization distortion and distortion due to decoding error probability (i.e., channel effects) on the incurred transmission latency. Our results show that there is an interesting interplay between source distortion (i.e., distortion for the probability vector measured via f-divergence) and the subsequent channel encoding/decoding parameters; and indicate that a joint design of these parameters is crucial to navigate the latency-distortion tradeoff. We study the impact of changing different parameters (e.g. number of classes, SNR, source distortion) on the latency-distortion tradeoff and perform experiments on AWGN and fading channels. Our results indicate that sparse lattice-based quantization is the most effective at minimizing latency across various regimes and for sparse, high-dimensional probability vectors (i.e., high number of classes).
