Source Coding with Free Bits and the Multi-Way Number Partitioning Problem
Niloufar Ahmadypour, Amin Gohari
TL;DR
This paper addresses a two-channel source coding problem with a free and a costly channel, reducing the encoding task to partitioning the source alphabet into k subsets and encoding within subsets. It establishes that a variant of Huffman coding, the Early Stopping Huffman algorithm, is optimal for the compression objective, which minimizes the conditional entropy H(X|A). The authors then connect this problem to the classical number partitioning problem, introducing a compression objective and an alpha-divergence objective, along with bounds and a principle of optimality for entropy-based and related objectives. The work advances a bridge between information theory and combinatorial partitioning, yielding an efficient O(n log n) method for exact solutions in the compression setting and provable approximation bounds for the broader alpha-divergence family, with potential implications for resource allocation and fair division in distributed systems.
Abstract
We introduce a new variant of variable-length source coding for sending a source over two parallel channels, one of which is costly and the other free. We give a complete solution to this problem. Next, we relate the problem to the number partitioning problem, which is the task of dividing a given list of numbers into a pre-specified number of subsets such that the sum of the numbers in each subset is as nearly equal as possible. We introduce two new objective functions for this problem and show that an adapted version of the Huffman coding algorithm (with a runtime of $\mathcal{O}(n \log n)$ for input size $n$) produces the optimal solution for one objective function, and a nearly optimal solution for the other objective function.
