The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs
Junli Fang, João F. C. Mota, Baoshan Lu, Weicheng Zhang, Xuemin Hong
TL;DR
This work introduces the rate-distortion-perception-classification (RDPC) tradeoff in joint source coding and modulation (JSCM), showing that minimizing channel rate under distortion, perceptual, and classification constraints yields a strict, convex tradeoff. It provides two complementary solutions: RDPCO, a heuristic optimizer under Gaussian mixture and linear encoders/decoders, and ID-GAN, an inverse-domain GAN framework that learns end-to-end encoders/decoders to balance reconstruction, perceptual quality, and semantic classification under channel noise. Theoretical results include a tight bound on RDPC under GMM assumptions and a convexity property of R(D,P,C), with empirical validation demonstrating RDPC behavior and superior perceptual and semantic performance relative to separation-based and existing deep JSCM methods. Collectively, the methods enable extreme compression while preserving perceptual integrity and classification accuracy, offering practical insights for robust, task-aware communications and semantic transmission.
Abstract
The joint source-channel coding (JSCC) framework leverages deep learning to learn from data the best codes for source and channel coding. When the output signal, rather than being binary, is directly mapped onto the IQ domain (complex-valued), we call the resulting framework joint source coding and modulation (JSCM). We consider a JSCM scenario and show the existence of a strict tradeoff between channel rate, distortion, perception, and classification accuracy, a tradeoff that we name RDPC. We then propose two image compression methods to navigate that tradeoff: the RDPCO algorithm which, under simple assumptions, directly solves the optimization problem characterizing the tradeoff, and an algorithm based on an inverse-domain generative adversarial network (ID-GAN), which is more general and achieves extreme compression. Simulation results corroborate the theoretical findings, showing that both algorithms exhibit the RDPC tradeoff. They also demonstrate that the proposed ID-GAN algorithm effectively balances image distortion, perception, and classification accuracy, and significantly outperforms traditional separation-based methods and recent deep JSCM architectures in terms of one or more of these metrics.
