Overlapped Arithmetic Codes
Yong Fang
TL;DR
Overlapped Arithmetic Codes extend classical arithmetic coding by mapping source symbols to partially overlapping sub-intervals, enabling distributed and joint codes through coset-based interval partitions. The paper develops a rigorous CCS framework to analyze and quantify how source-space partitioning into unequal cosets affects encoding/decoding, rate loss, and decoding complexity, and it links this with an asymptotic spectrum f(u) and a suite of practical decoding strategies. It introduces backward-recursive CCS computations, intrinsic/extrinsic path metrics for decoding, and coexisting-interval analysis to derive error rates under partial knowledge of the block. The work further connects CCS to the Hamming Distance Spectrum, proposes soft/hard/fast approximations to HDS, and presents experimental validation, illustrating significant gains in low-complexity decoding scenarios and establishing a rich mathematical foundation for overlapped arithmetic codes in distributed and joint coding settings.
Abstract
Arithmetic codes are usually deemed as the most important means to implement lossless source coding, whose principle is mapping every source symbol to a sub-interval in [0, 1). For every source symbol, the length of its mapping sub-interval is exactly equal to its probability. With this symbol-interval mapping rule, the interval [0,1) will be fully covered and there is neither overlapped sub-interval (corresponds to more than one source symbol) nor forbidden sub-interval (does not correspond to any source symbol). It is well-known that there is a duality between source coding and channel coding, so every good source code may also be a good channel code meanwhile, and vice versa. Inspired by this duality, arithmetic codes can be easily generalized to address many coding problems beyond source coding by redefining the source-interval mapping rule. If every source symbol is mapped to an enlarged sub-interval, the mapping sub-intervals of different source symbols will be partially overlapped and we obtain overlapped arithmetic codes, which can realize distributed source coding. On the contrary, if every source symbol is mapped to a narrowed sub-interval, there will be one or more forbidden sub-intervals in [0, 1) that do not correspond to any source symbol and we obtain forbidden arithmetic codes, which can implement joint source-channel coding. Furthermore, by allowing the coexistence of overlapped sub-intervals and forbidden sub-intervals, we will obtain hybrid arithmetic codes, which can cope with distributed joint source-channel coding.
