Some Notes on the Sample Complexity of Approximate Channel Simulation

Gergely Flamich; Lennie Wells

Some Notes on the Sample Complexity of Approximate Channel Simulation

Gergely Flamich, Lennie Wells

TL;DR

The paper analyzes the sample complexity of approximate one-shot channel simulation (OSCS). It proves a super-polynomial runtime lower bound in $D_{\infty}[Q||P]$ for general approximate schemes, implying hardness of approximate sampling in the worst case. It then identifies a favorable regime: with access to the unnormalized density ratio $r \propto dQ/dP$ and knowledge of $D_{KL}[Q||P]$, depth-limited A* coding yields $D_{TV}[Q||P] \le \epsilon$ with sample complexity $\exp_2\big((D_{KL}[Q||P] + o(1))/\epsilon\big)$, enabling near-optimal encoding rates. To push practical applicability, the authors refine the rejection-sampling approach via a better target $Q_M$ and show that depth-limited A* (or equivalent) can achieve similar TV guarantees with favorable dependence on the $f$-divergence and $D_{KL}$, avoiding dependence on $\|r\|_\infty$. Finally, they adapt these ideas to OSCS by replacing rejection sampling with depth-limited A* coding, achieving near-optimal encoding lengths and providing a roadmap for realism-constrained lossy compression pipelines that leverage structure in density ratios and KL divergence.

Abstract

Channel simulation algorithms can efficiently encode random samples from a prescribed target distribution $Q$ and find applications in machine learning-based lossy data compression. However, algorithms that encode exact samples usually have random runtime, limiting their applicability when a consistent encoding time is desirable. Thus, this paper considers approximate schemes with a fixed runtime instead. First, we strengthen a result of Agustsson and Theis and show that there is a class of pairs of target distribution $Q$ and coding distribution $P$, for which the runtime of any approximate scheme scales at least super-polynomially in $D_\infty[Q \Vert P]$. We then show, by contrast, that if we have access to an unnormalised Radon-Nikodym derivative $r \propto dQ/dP$ and knowledge of $D_{KL}[Q \Vert P]$, we can exploit global-bound, depth-limited A* coding to ensure $\mathrm{TV}[Q \Vert P] \leq ε$ and maintain optimal coding performance with a sample complexity of only $\exp_2\big((D_{KL}[Q \Vert P] + o(1)) \big/ ε\big)$.

Some Notes on the Sample Complexity of Approximate Channel Simulation

TL;DR

The paper analyzes the sample complexity of approximate one-shot channel simulation (OSCS). It proves a super-polynomial runtime lower bound in

for general approximate schemes, implying hardness of approximate sampling in the worst case. It then identifies a favorable regime: with access to the unnormalized density ratio

and knowledge of

, depth-limited A* coding yields

with sample complexity

, enabling near-optimal encoding rates. To push practical applicability, the authors refine the rejection-sampling approach via a better target

and show that depth-limited A* (or equivalent) can achieve similar TV guarantees with favorable dependence on the

-divergence and

, avoiding dependence on

. Finally, they adapt these ideas to OSCS by replacing rejection sampling with depth-limited A* coding, achieving near-optimal encoding lengths and providing a roadmap for realism-constrained lossy compression pipelines that leverage structure in density ratios and KL divergence.

Abstract

Channel simulation algorithms can efficiently encode random samples from a prescribed target distribution

and find applications in machine learning-based lossy data compression. However, algorithms that encode exact samples usually have random runtime, limiting their applicability when a consistent encoding time is desirable. Thus, this paper considers approximate schemes with a fixed runtime instead. First, we strengthen a result of Agustsson and Theis and show that there is a class of pairs of target distribution

and coding distribution

, for which the runtime of any approximate scheme scales at least super-polynomially in

. We then show, by contrast, that if we have access to an unnormalised Radon-Nikodym derivative

and knowledge of

, we can exploit global-bound, depth-limited A* coding to ensure

and maintain optimal coding performance with a sample complexity of only

Paper Structure (10 sections, 5 theorems, 50 equations, 1 algorithm)

This paper contains 10 sections, 5 theorems, 50 equations, 1 algorithm.

Introduction
Background
Why characterise approximate channel simulation using total variation?
Improving a result of Agustsson and Theis
Improving the scheme of Block and Polyanskiy
Approximate Channel Simulation Using the Improved Scheme
Discussion and Future Work
Approximate target distribution: properties and optimality
Discussion of importance sampling bounds of Chatterjee and Diaconis
Lower bound on the entropy of the Block-Polyanskiy rejection sampler

Key Result

Theorem 4.1

Consider an algorithm which receives the parameters of an arbitrary RBM $Q$ of problem size $M$ as input and has access to an unlimited number of i.i.d. random variables $Z_n\! \sim\! P$, where $P$ is the uniform measure over $\{0,1\}^M$. It outputs $\widetilde{Z}\! \sim\! \widetilde{Q}$ with $D_{TV

Theorems & Definitions (13)

Definition 2.1: Selection samplers
Definition 4.1: Restricted Boltzmann Machine (RBM)
Definition 4.2: Efficiently evaluatable representation
Theorem 4.1
proof
Theorem 4.2
Lemma A.1
proof
Proposition A.2
proof
...and 3 more

Some Notes on the Sample Complexity of Approximate Channel Simulation

TL;DR

Abstract

Some Notes on the Sample Complexity of Approximate Channel Simulation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (13)