Table of Contents
Fetching ...

Computing Low-Entropy Couplings for Large-Support Distributions

Samuel Sokota, Dylan Sam, Christian Schroeder de Witt, Spencer Compton, Jakob Foerster, J. Zico Kolter

TL;DR

This work tackles the minimum-entropy coupling (MEC) problem for marginals with large supports, where exact MEC is intractable. It unifies iterative MEC methods under a partition-based formalism and introduces ARIMEC, the first IMEC variant capable of handling arbitrary discrete distributions, by leveraging an autoregressive prefix-tree partition set. To address brittleness to partition choices, it adds merging, which groups posterior updates to reduce entropy waste and improve robustness. Empirical results in Markov coding games and steganography demonstrate that ARIMEC, especially with merging, achieves higher throughput and more reliable decoding than prior approaches, enabling practical low-entropy couplings for large-scale applications. The work also provides efficient implementation strategies and a codebase for broader adoption in high-throughput, large-support MEC tasks.

Abstract

Minimum-entropy coupling (MEC) -- the process of finding a joint distribution with minimum entropy for given marginals -- has applications in areas such as causality and steganography. However, existing algorithms are either computationally intractable for large-support distributions or limited to specific distribution types and sensitive to hyperparameter choices. This work addresses these limitations by unifying a prior family of iterative MEC (IMEC) approaches into a generalized partition-based formalism. From this framework, we derive a novel IMEC algorithm called ARIMEC, capable of handling arbitrary discrete distributions, and introduce a method to make IMEC robust to suboptimal hyperparameter settings. These innovations facilitate the application of IMEC to high-throughput steganography with language models, among other settings. Our codebase is available at https://github.com/ssokota/mec .

Computing Low-Entropy Couplings for Large-Support Distributions

TL;DR

This work tackles the minimum-entropy coupling (MEC) problem for marginals with large supports, where exact MEC is intractable. It unifies iterative MEC methods under a partition-based formalism and introduces ARIMEC, the first IMEC variant capable of handling arbitrary discrete distributions, by leveraging an autoregressive prefix-tree partition set. To address brittleness to partition choices, it adds merging, which groups posterior updates to reduce entropy waste and improve robustness. Empirical results in Markov coding games and steganography demonstrate that ARIMEC, especially with merging, achieves higher throughput and more reliable decoding than prior approaches, enabling practical low-entropy couplings for large-scale applications. The work also provides efficient implementation strategies and a codebase for broader adoption in high-throughput, large-support MEC tasks.

Abstract

Minimum-entropy coupling (MEC) -- the process of finding a joint distribution with minimum entropy for given marginals -- has applications in areas such as causality and steganography. However, existing algorithms are either computationally intractable for large-support distributions or limited to specific distribution types and sensitive to hyperparameter choices. This work addresses these limitations by unifying a prior family of iterative MEC (IMEC) approaches into a generalized partition-based formalism. From this framework, we derive a novel IMEC algorithm called ARIMEC, capable of handling arbitrary discrete distributions, and introduce a method to make IMEC robust to suboptimal hyperparameter settings. These innovations facilitate the application of IMEC to high-throughput steganography with language models, among other settings. Our codebase is available at https://github.com/ssokota/mec .
Paper Structure (41 sections, 17 theorems, 41 equations, 10 figures, 7 algorithms)

This paper contains 41 sections, 17 theorems, 41 equations, 10 figures, 7 algorithms.

Key Result

Proposition 3.1

IMEC induces a coupling of $\mu$ and $\nu$.

Figures (10)

  • Figure 1: (Left) A set $\mathbb{X}$ of sequences of length 3; (right) a partition $\mathcal{P}_2$ used by FIMEC.
  • Figure 2: (Left) A set $\mathbb{X}$ of sequences of length 2; (middle) the prefix tree for $\mathbb{X}$; (right) the partition induced by the left-most depth one node of the prefix tree.
  • Figure 3: Visualization of two iterations of ARIMEC.
  • Figure 4: Path shown down the prefix tree corresponding to the procedure in \ref{['fig:vis_arimec']}.
  • Figure 5: Results for the Markov coding games CodeCart and CodePong using MaxEntRL policies with different temperatures with 95% bootstrap confidence intervals drawn from 100 games.
  • ...and 5 more figures

Theorems & Definitions (37)

  • Definition 2.1: Coupling
  • Definition 2.2: Joint Entropy
  • Definition 2.3: Minimum-Entropy Coupling
  • Definition 2.5: Partition
  • Definition 2.6: Block Function
  • Proposition 3.1: Coupling
  • Proposition 3.2: Greediness
  • Proposition 3.3: IMEC Runtime
  • Corollary 3.0: TIMEC Runtime
  • Corollary 3.0: FIMEC Runtime
  • ...and 27 more