Data Compression with Stochastic Codes
Gergely Flamich, Deniz Gündüz
TL;DR
This work surveys relative entropy coding (REC), a stochastic-code framework that extends lossy data compression beyond fixed quantisers by allowing arbitrary reconstruction distributions $P_{at X|X}$ and leveraging shared randomness. It organizes a spectrum of constructions—from rejection sampling and Poisson functional representations to dithered quantisers and selection samplers—while analysing runtime bounds and practical constraints, including the necessity of synchronization. The paper demonstrates REC's potential through applications in learned compression, realism-aware coding, privacy-preserving mechanisms, and communication-efficient reinforcement learning, and discusses extensions to multidimensional data, channel-code-based REC, and advanced sampling techniques. It also candidly discusses limitations, notably speed and synchronization challenges, and positions REC as a promising direction for end-to-end optimization and rate-distortion-realism trade-offs, pending advances in fast, robust implementations.
Abstract
Machine learning has had a major impact on data compression over the last decade and inspired many new, exciting theoretical and applied questions. This paper describes one such direction -- relative entropy coding -- which focuses on constructing stochastic codes, primarily as an alternative to quantisation and entropy coding in lossy source coding. Our primary aim is to provide a broad overview of the topic, with an emphasis on the computational and practical aspects currently missing from the literature. Our goal is threefold: for the curious reader, we aim to provide an intuitive picture of the field and convince them that relative entropy coding is a simple yet exciting emerging field in data compression research. For a reader interested in applied research on lossy data compression, we provide an account of the most salient contemporary applications. Finally, for the reader who has heard of relative entropy coding but has never been quite sure what it is or how the algorithms fit together, we hope to illustrate how simple and elegant the underlying constructions are.
