Table of Contents
Fetching ...

On the Rate-Distortion-Perception Function for Gaussian Processes

Giuseppe Serra, Photios A. Stavrou, Marios Kountouris

TL;DR

A tight analytical upper bound on the RDPF for GPs is derived, which recovers the optimal solution in the “perfect realism” regime.

Abstract

In this paper, we investigate the rate-distortion-perception function (RDPF) of a source modeled by a Gaussian Process (GP) on a measure space $Ω$ under mean squared error (MSE) distortion and squared Wasserstein-2 perception metrics. First, we show that the optimal reconstruction process is itself a GP, characterized by a covariance operator sharing the same set of eigenvectors of the source covariance operator. Similarly to the classical rate-distortion function, this allows us to formulate the RDPF problem in terms of the Karhunen-Loève transform coefficients of the involved GPs. Leveraging the similarities with the finite-dimensional Gaussian RDPF, we formulate an analytical tight upper bound for the RDPF for GPs, which recovers the optimal solution in the "perfect realism" regime. Lastly, in the case where the source is a stationary GP and $Ω$ is the interval $[0, T]$ equipped with the Lebesgue measure, we derive an upper bound on the rate and the distortion for a fixed perceptual level and $T \to \infty$ as a function of the spectral density of the source process.

On the Rate-Distortion-Perception Function for Gaussian Processes

TL;DR

A tight analytical upper bound on the RDPF for GPs is derived, which recovers the optimal solution in the “perfect realism” regime.

Abstract

In this paper, we investigate the rate-distortion-perception function (RDPF) of a source modeled by a Gaussian Process (GP) on a measure space under mean squared error (MSE) distortion and squared Wasserstein-2 perception metrics. First, we show that the optimal reconstruction process is itself a GP, characterized by a covariance operator sharing the same set of eigenvectors of the source covariance operator. Similarly to the classical rate-distortion function, this allows us to formulate the RDPF problem in terms of the Karhunen-Loève transform coefficients of the involved GPs. Leveraging the similarities with the finite-dimensional Gaussian RDPF, we formulate an analytical tight upper bound for the RDPF for GPs, which recovers the optimal solution in the "perfect realism" regime. Lastly, in the case where the source is a stationary GP and is the interval equipped with the Lebesgue measure, we derive an upper bound on the rate and the distortion for a fixed perceptual level and as a function of the spectral density of the source process.
Paper Structure (12 sections, 22 equations, 2 figures)

This paper contains 12 sections, 22 equations, 2 figures.

Figures (2)

  • Figure 1: Source Power Spectrum $S_X(f)$ vs. per-frequency distortion $\hat{D}(S_X(f))$ for $\gamma = 0.7$ and varying $\alpha$.
  • Figure 2: (a) Rate $\tilde{R}$ and (b) distortion $\tilde{D}$ curves parameterized by $(\alpha, \gamma)$.