Optimal Neural Compressors for the Rate-Distortion-Perception Tradeoff
Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti
TL;DR
This work addresses the neural rate-distortion-perception (RDP) tradeoff by proposing lattice transform coding (LTC) based compressors that incorporate dithering under varying shared randomness budgets. The authors introduce SD-LTC (infinite shared randomness), PD-LTC (no shared randomness), and QSD-LTC (finite shared randomness via nested lattices), and prove that SD-LTC achieves the RDP function $R(D,P)$ for Gaussian sources while PD-LTC achieves $R(D/2,\infty)$ at $P=0$; QSD-LTC provides a practical continuum between these regimes. Empirical results on synthetic Gaussian data and real-world datasets (e.g., MNIST, Physics, Speech) show that increasing shared randomness and lattice packing efficiency improves the RDP performance, with QSD-LTC closely approaching SD-LTC for modest randomness budgets. The findings offer a scalable, low-complexity path to RDP-optimal neural compressors and highlight the tradeoffs between randomness, lattice quality, and computational efficiency for perceptually aware compression.
Abstract
Recent efforts in neural compression have focused on the rate-distortion-perception (RDP) tradeoff, where the perception constraint ensures the source and reconstruction distributions are close in terms of a statistical divergence. Theoretical work on RDP describes properties of RDP-optimal compressors without providing constructive and low complexity solutions. While classical rate distortion theory shows that optimal compressors should efficiently pack space, RDP theory additionally shows that infinite randomness shared between the encoder and decoder may be necessary for RDP optimality. In this paper, we propose neural compressors that are low complexity and benefit from high packing efficiency through lattice coding and shared randomness through shared dithering over the lattice cells. For two important settings, namely infinite shared and zero shared randomness, we analyze the RDP tradeoff achieved by our proposed neural compressors and show optimality in both cases. Experimentally, we investigate the roles that these two components of our design, lattice coding and randomness, play in the performance of neural compressors on synthetic and real-world data. We observe that performance improves with more shared randomness and better lattice packing.
