Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

Sadaf Salehkalaibar; Jun Chen; Ashish Khisti; Wei Yu

Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

Sadaf Salehkalaibar, Jun Chen, Ashish Khisti, Wei Yu

TL;DR

This paper addresses the rate-distortion-perception (RDP) tradeoff for memoryless sources under a perception measure defined by the conditional distribution of the source given the encoder output, focusing on the no-shared-randomness setting. It derives a single-letter characterization of the RDP function for finite alphabets and extends it to continuous alphabets with squared-error distortion and squared Wasserstein perception, showing the decoder can be implemented as a noise-adding operation on the MMSE estimate. Key contributions include the equality $R_{ ext{C}}(D,P)=R(D,P)$ for discrete sources, a closed-form Bernoulli result with an envelope correction, a continuous-alphabet equivalence with a practical representation via $U'= ext{E}[X|U]$, and a Gaussian-vector reverse-waterfilling solution; Gaussian mixtures receive partial treatment. The results provide a tractable framework for designing perceptually realistic compression without shared randomness and connect to existing marginal-perception findings, with implications for Gaussian-source coding and potential extensions to networks and neural compression systems.

Abstract

This paper studies the rate-distortion-perception (RDP) tradeoff for a memoryless source model in the asymptotic limit of large block-lengths. The perception measure is based on a divergence between the distributions of the source and reconstruction sequences \emph{conditioned} on the encoder output, first proposed by Mentzer et al. We consider the case when there is no shared randomness between the encoder and the decoder and derive a single-letter characterization of the RDP function for the case of discrete memoryless sources. This is in contrast to the marginal-distribution metric case (introduced by Blau and Michaeli), whose RDP characterization remains open when there is no shared randomness. The achievability scheme is based on lossy source coding with a posterior reference map. For the case of continuous valued sources under the squared error distortion measure and the squared quadratic Wasserstein perception measure, we also derive a single-letter characterization and show that the decoder can be restricted to a noise-adding mechanism. Interestingly, the RDP function characterized for the case of zero perception loss coincides with that of the marginal metric, and further zero perception loss can be achieved with a 3-dB penalty in minimum distortion. Finally we specialize to the case of Gaussian sources, and derive the RDP function for Gaussian vector case and propose a reverse water-filling type solution. We also partially characterize the RDP function for a mixture of Gaussian vector sources.

Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

TL;DR

for discrete sources, a closed-form Bernoulli result with an envelope correction, a continuous-alphabet equivalence with a practical representation via

, and a Gaussian-vector reverse-waterfilling solution; Gaussian mixtures receive partial treatment. The results provide a tractable framework for designing perceptually realistic compression without shared randomness and connect to existing marginal-perception findings, with implications for Gaussian-source coding and potential extensions to networks and neural compression systems.

Abstract

Paper Structure (22 sections, 15 theorems, 162 equations, 2 figures)

This paper contains 22 sections, 15 theorems, 162 equations, 2 figures.

Introduction
Problem Formulation
System Model
Rate-Distortion-Perception Function
Finite Alphabet Sources
Continous Alphabet Sources
Conclusion
Proof of Proposition \ref{['prop:property']}
Proof of Proposition \ref{['prop:convexity']}
Proof of Theorem \ref{['thm:RDP']}
Proof of Theorem \ref{['thm:binary']}
On the Concavity of $\hbar$
Proof of Theorem \ref{['thm:continuous']}
Proof of (\ref{['eq:continuousRDP']})
$D>0$ and $P=0$
...and 7 more sections

Key Result

Proposition 1

$\phi$ defined in (eq:divergence) has the following properties.

Figures (2)

Figure 1: System model with the perception measure based on the conditional distribution.
Figure 2: Reverse water-filling solution for the Gaussian vector source.

Theorems & Definitions (26)

Proposition 1
Definition 1
Proposition 2
Proposition 3
Theorem 1
Remark 1
Theorem 2
Remark 2
Remark 3
Theorem 3
...and 16 more

Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

TL;DR

Abstract

Rate-Distortion-Perception Tradeoff Based on the Conditional-Distribution Perception Measure

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (26)