A Theory of Universal Rate-Distortion-Classification Representations for Lossy Compression
Nam Nguyen, Thinh Nguyen, Bella Bose
TL;DR
The paper addresses multi-objective lossy compression by extending RD theory with perception and classification, proposing universal representations that fix the encoder and reuse decoders to achieve diverse distortion-classification tradeoffs. It proves that for Gaussian sources under MSE, a single fixed encoder can realize the entire distortion-classification region without rate penalty, and it provides a generalized characterization for arbitrary sources using MMSE and Wasserstein concepts, including an asymptotic equivalence R^(∞)(Θ) = R(Θ). Empirically, universal encoders trained with Wasserstein regularization perform comparably to task-specific models on MNIST and SVHN, validating practicality for multi-task compression. The results offer a scalable approach to multi-objective compression where updating decoders suffices to meet varying downstream requirements, reducing design and training burden while preserving performance. These findings advance the deployment of versatile, task-aware compression systems in real-world applications with strict perceptual and analytical requirements.
Abstract
In lossy compression, Blau and Michaeli [5] introduced the information rate-distortion-perception (RDP) function, extending traditional rate-distortion theory by incorporating perceptual quality. More recently, this framework was expanded by defining the rate-distortion-perception-classification (RDPC) function, integrating multi-task learning that jointly optimizes generative tasks such as perceptual quality and classification accuracy alongside reconstruction tasks [28]. To that end, motivated by the concept of a universal RDP encoder introduced in [34], we investigate universal representations that enable diverse distortion-classification tradeoffs through a single fixed encoder combined with multiple decoders. Specifically, theoretical analysis and numerical experiment demonstrate that for the Gaussian source under mean squared error (MSE) distortion, the entire distortion-classification tradeoff region can be achieved using one universal encoder. In addition, this paper characterizes achievable distortion-classification regions for fixed universal representations in general source distributions, identifying conditions that ensure minimal distortion penalty when reusing encoders across varying tradeoff points. Experimental results using MNIST and SVHN datasets validate our theoretical insights, showing that universal encoders can obtain distortion performance comparable to task-specific encoders, thus supporting the practicality and effectiveness of our proposed universal representations.
