Table of Contents
Fetching ...

Unbounded Error Correcting Codes

Klim Efremenko, Or Zamir

TL;DR

This work introduces unbounded error-correcting codes to handle indefinite-length transmissions by defining $(R,\varepsilon)$-unbounded codes whose prefixes remain decodable under adversarial noise. It establishes a near-tight rate-distance tradeoff over binary alphabets, proving $R<1-\Omega(\sqrt{\varepsilon})$ and $R>1- O(\sqrt{\varepsilon \log\log(1/\varepsilon)})$; it also shows linear codes perform strictly worse than non-linear ones and that, in the random-noise regime, the optimal rate matches that of standard ECCs. The paper introduces subset codes as a key combinatorial tool, enabling improved constructions that push the rate closer to 1 via layered redundancy and checksums. It also delineates a distinction between adversarial and random-noise models and discusses alphabet-size effects, leaving open the precise gap and explicit constructions as future directions. Overall, unbounded codes reveal fundamental differences from classical ECCs in the unlimited-length setting and open avenues for streaming and real-time communication under hostile error patterns.

Abstract

Traditional error-correcting codes (ECCs) assume a fixed message length, but many scenarios involve ongoing or indefinite transmissions where the message length is not known in advance. For example, when streaming a video, the user should be able to fix a fraction of errors that occurred before any point in time. We introduce unbounded error-correcting codes (unbounded codes), a natural generalization of ECCs that supports arbitrarily long messages without a predetermined length. An unbounded code with rate $R$ and distance $\varepsilon$ ensures that for every sufficiently large $k$, the message prefix of length $Rk$ can be recovered from the code prefix of length $k$ even if an adversary corrupts up to an $\varepsilon$ fraction of the symbols in this code prefix. We study unbounded codes over binary alphabets in the regime of small error fraction $\varepsilon$, establishing nearly tight upper and lower bounds on their optimal rate. Our main results show that: (1) The optimal rate of unbounded codes satisfies $R<1-Ω(\sqrt{\varepsilon})$ and $R>1-O(\sqrt{\varepsilon \log \log(1/\varepsilon)})$. (2) Surprisingly, our construction is inherently non-linear, as we prove that linear unbounded codes achieve a strictly worse rate of $R=1-Θ(\sqrt{\varepsilon \log(1/\varepsilon)})$. (3) In the setting of random noise, unbounded codes achieve the same optimal rate as standard ECCs, $R=1-Θ(\varepsilon \log(1/\varepsilon))$. These results demonstrate fundamental differences between standard and unbounded codes.

Unbounded Error Correcting Codes

TL;DR

This work introduces unbounded error-correcting codes to handle indefinite-length transmissions by defining -unbounded codes whose prefixes remain decodable under adversarial noise. It establishes a near-tight rate-distance tradeoff over binary alphabets, proving and ; it also shows linear codes perform strictly worse than non-linear ones and that, in the random-noise regime, the optimal rate matches that of standard ECCs. The paper introduces subset codes as a key combinatorial tool, enabling improved constructions that push the rate closer to 1 via layered redundancy and checksums. It also delineates a distinction between adversarial and random-noise models and discusses alphabet-size effects, leaving open the precise gap and explicit constructions as future directions. Overall, unbounded codes reveal fundamental differences from classical ECCs in the unlimited-length setting and open avenues for streaming and real-time communication under hostile error patterns.

Abstract

Traditional error-correcting codes (ECCs) assume a fixed message length, but many scenarios involve ongoing or indefinite transmissions where the message length is not known in advance. For example, when streaming a video, the user should be able to fix a fraction of errors that occurred before any point in time. We introduce unbounded error-correcting codes (unbounded codes), a natural generalization of ECCs that supports arbitrarily long messages without a predetermined length. An unbounded code with rate and distance ensures that for every sufficiently large , the message prefix of length can be recovered from the code prefix of length even if an adversary corrupts up to an fraction of the symbols in this code prefix. We study unbounded codes over binary alphabets in the regime of small error fraction , establishing nearly tight upper and lower bounds on their optimal rate. Our main results show that: (1) The optimal rate of unbounded codes satisfies and . (2) Surprisingly, our construction is inherently non-linear, as we prove that linear unbounded codes achieve a strictly worse rate of . (3) In the setting of random noise, unbounded codes achieve the same optimal rate as standard ECCs, . These results demonstrate fundamental differences between standard and unbounded codes.

Paper Structure

This paper contains 20 sections, 23 theorems, 35 equations, 1 figure, 1 table.

Key Result

Theorem 1

For every small enough $\varepsilon>0$ there exists a $(R,\varepsilon)$-unbounded code with $R>1-O\left(\sqrt{\varepsilon\log\log\left(1/\varepsilon\right)}\right)$. Furthermore, for every $(R,\varepsilon)$-unbounded code it holds that $R<1-\Omega\left(\sqrt{\varepsilon}\right)$.

Figures (1)

  • Figure 1: Partition of the new message bits into blocks and sub-blocks, and their participation in the subset codes.

Theorems & Definitions (49)

  • Definition : Unbounded codes
  • Theorem
  • Theorem
  • Theorem
  • Theorem 4.1
  • Lemma 4.2
  • Claim 4.3
  • proof
  • proof : Proof of Lemma \ref{['lem:const_tau']}
  • proof : Proof of Theorem \ref{['thm:const']}
  • ...and 39 more