Data Compression with Relative Entropy Coding
Gergely Flamich
TL;DR
The thesis studies Data Compression with Relative Entropy Coding, a framework that generalizes classical source coding to uncertain, randomized information and continuous spaces, enabling privacy and perceptual considerations in ML-based compression. It develops tight fundamental limits via the channel simulation divergence, and introduces fast, Poisson-process–based samplers (rejection sampling, A* sampling, Greedy Poisson Rejection Sampling) that approach these limits while enabling parallelism and approximate schemes. It also introduces COMBINER, a practically efficient compression scheme using Bayesian implicit neural representations, demonstrating high performance on image, video, audio, and protein data with low energy and small models. The work bridges theory and practice by providing constructive algorithms with provable guarantees, practical implementation strategies, and clear guidelines for deployment in real-world ML pipelines. Overall, the combination of rigorous limits, fast sampler constructions, and energy-efficient compression architectures positions relative entropy coding as a viable, scalable alternative to conventional quantization-based methods for modern data compression tasks.
Abstract
Over the last few years, machine learning unlocked previously infeasible features for compression, such as providing guarantees for users' privacy or tailoring compression to specific data statistics (e.g., satellite images or audio recordings of animals) or users' audiovisual perception. This, in turn, has led to an explosion of theoretical investigations and insights that aim to develop new fundamental theories, methods and algorithms better suited for machine learning-based compressors. In this thesis, I contribute to this trend by investigating relative entropy coding, a mathematical framework that generalises classical source coding theory. Concretely, relative entropy coding deals with the efficient communication of uncertain or randomised information. One of its key advantages is that it extends compression methods to continuous spaces and can thus be integrated more seamlessly into modern machine learning pipelines than classical quantisation-based approaches. Furthermore, it is a natural foundation for developing advanced compression methods that are privacy-preserving or account for the perceptual quality of the reconstructed data. The thesis considers relative entropy coding at three conceptual levels: After introducing the basics of the framework, (1) I prove results that provide new, maximally tight fundamental limits to the communication and computational efficiency of relative entropy coding; (2) I use the theory of Poisson point processes to develop and analyse new relative entropy coding algorithms, whose performance attains the theoretic optima and (3) I showcase the strong practical performance of relative entropy coding by applying it to image, audio, video and protein data compression using small, energy-efficient, probabilistic neural networks called Bayesian implicit neural representations.
