Channel Simulation and Distributed Compression with Ensemble Rejection Sampling
Buu Phan, Ashish Khisti
TL;DR
This work studies channel simulation and distributed matching through the Ensemble Rejection Sampling (ERS) framework. ERS combines rejection sampling with importance sampling to achieve near-optimal coding costs for channel simulation while boosting distributed matching probabilities to approach Poisson Matching Lemma (PML) performance, even when the decoder learns the target distribution (e.g., via machine learning). The authors establish concrete coding bounds for RS and ERS, quantify matching probabilities with and without batch communication, and extend these results to lossy compression with side information (Wyner-Ziv). Empirical results on synthetic Gaussian sources and MNIST (and CIFAR-10 variants) demonstrate that ERS yields competitive rate-distortion performance and unbiased sampling, illustrating practical applicability in distributed compression and learning-driven settings.
Abstract
We study channel simulation and distributed matching, two fundamental problems with several applications to machine learning, using a recently introduced generalization of the standard rejection sampling (RS) algorithm known as Ensemble Rejection Sampling (ERS). For channel simulation, we propose a new coding scheme based on ERS that achieves a near-optimal coding rate. In this process, we demonstrate that standard RS can also achieve a near-optimal coding rate and generalize the result of Braverman and Garg (2014) to the continuous alphabet setting. Next, as our main contribution, we present a distributed matching lemma for ERS, which serves as the rejection sampling counterpart to the Poisson Matching Lemma (PML) introduced by Li and Anantharam (2021). Our result also generalizes a recent work on importance matching lemma (Phan et al, 2024) and, to our knowledge, is the first result on distributed matching in the family of rejection sampling schemes where the matching probability is close to PML. We demonstrate the practical significance of our approach over prior works by applying it to distributed compression. The effectiveness of our proposed scheme is validated through experiments involving synthetic Gaussian sources and distributed image compression using the MNIST dataset.
