Characterization of the Distortion-Perception Tradeoff for Finite Channels with Arbitrary Metrics
Dror Freirich, Nir Weinberger, Ron Meir
TL;DR
We address the distortion-perception tradeoff in finite-alphabet channels by formulating the problem with a Wasserstein-1 perception index and a general distortion matrix. The DP function $D(P)$ is shown to be computable via linear programming and, in the discrete setting, is necessarily piecewise linear with a finite set of breakpoints. A dual OT-based characterization yields a structural understanding: the DP curve is the upper envelope of a finite family of linear functions, with breakpoints tied to dual-vertex projections. For binary sources we derive a closed-form expression, revealing explicit breakpoints and linear segments, enabling exact, efficient computation of the DP curve. These results unify and extend prior work on rate-distortion-perception and discrete DP, with practical implications for constructing perceptually constrained reconstructions via simple breakpoint-based strategies.
Abstract
Whenever inspected by humans, reconstructed signals should not be distinguished from real ones. Typically, such a high perceptual quality comes at the price of high reconstruction error, and vice versa. We study this distortion-perception (DP) tradeoff over finite-alphabet channels, for the Wasserstein-$1$ distance induced by a general metric as the perception index, and an arbitrary distortion matrix. Under this setting, we show that computing the DP function and the optimal reconstructions is equivalent to solving a set of linear programming problems. We provide a structural characterization of the DP tradeoff, where the DP function is piecewise linear in the perception index. We further derive a closed-form expression for the case of binary sources.
