Low-Rate, Low-Distortion Compression with Wasserstein Distortion
Yang Qiu, Aaron B. Wagner
TL;DR
This work introduces Wasserstein distortion as a unified fidelity-realism measure for rate-distortion theory, using a pooling width $\sigma$ to interpolate between pixel-level fidelity and ensemble realism. It proves continuity results showing $D$ converges to fidelity in the $\sigma\to0$ limit and to realism in the $\sigma\to\infty$ limit for ergodic sources, and analyzes the large-$\sigma$ regime to derive two practical achievability schemes and a converse. The two schemes—Independent Realization and Random Permutation—establish scalable $R$ and $D$ tradeoffs with explicit $\sigma$-dependent rates, leading to an asymptotic region for $(\alpha,\beta)$ that characterizes the vanishing-rate, vanishing-distortion behavior. A sandwich argument ties the analysis together by bounding the distortion between tractable proxy distortions, enabling robust transfer of results to general Wasserstein distortions. Overall, the paper lays theoretical foundations for low-rate, low-distortion compression under a perceptually informed distortion metric, with implications for texture-like and human-visual-system-inspired coding.
Abstract
Wasserstein distortion is a one-parameter family of distortion measures that was recently proposed to unify fidelity and realism constraints. After establishing continuity results for Wasserstein in the extreme cases of pure fidelity and pure realism, we prove the first coding theorems for compression under Wasserstein distortion focusing on the regime in which both the rate and the distortion are small.
