Table of Contents
Fetching ...

Optimal redundancy of function-correcting codes

Gennian Ge, Zixiang Xu, Xiande Zhang, Yijun Zhang

TL;DR

This work advances the theory of function-correcting codes (FCCs) by precisely characterizing the redundancy required to protect key function values of binary messages under errors. It develops the distance-demand/irregular-distance code framework to relate FCC redundancy to combinatorial packing via the distance-demand matrix, and then derives tight results for two principal function classes: the Hamming weight and the Hamming weight distribution. For Hamming weight, the authors fuse a novel Gray-code–based construction with refined counting arguments to achieve near-optimal redundancy, lowering the bound to $r_{ ext{wt}}(k,t) \ge 4t - \frac{4}{3}\sqrt{6t+2} + 2$ and tightening the upper bound to $r_{ ext{wt}}(k,t) \le 4t - \log t$. For the weight distribution, they establish distinct regimes: $r_{\Delta_T}(k,t)=2t$ when $T \ge t+1$ and, when $T = o(t)$, a near-optimal $4t - o(t)$ bound, with detailed constructions and bounds depending on $T$ and the quotient $\frac{2t+1}{T}$. The paper further provides a general construction pathway via Gray codes and specific linear codes (e.g., Simplex and Belov-type codes) to realize explicit FCCs with improved redundancy bounds. Overall, the results reveal two different redundancy behaviors under different function definitions and offer practically relevant, explicit code constructions that substantially narrow the gap toward optimal redundancy. The findings have potential implications for storage and communication systems where only particular function values need protection, and they point to extensions to symbol-pair FCCs and other function families.

Abstract

Function-correcting codes, introduced by Lenz, Bitar, Wachter-Zeh, and Yaakobi, protect specific function values of a message rather than the entire message. A central challenge is determining the optimal redundancy -- the minimum additional information required to recover function values amid errors. This redundancy depends on both the number of correctable errors $t$ and the structure of message vectors yielding identical function values. While prior works established bounds, key questions remain, such as the optimal redundancy for functions like Hamming weight and Hamming weight distribution, along with efficient code constructions. In this paper, we make the following contributions: (1) For the Hamming weight function, we improve the lower bound on optimal redundancy from $\frac{10(t-1)}{3}$ to $4t - \frac{4}{3}\sqrt{6t+2} + 2$. On the other hand, we provide a systematical approach to constructing explicit FCCs via a novel connection with Gray codes, which also improve the previous upper bound from $\frac{4t-2}{1 - 2\sqrt{\log{2t}/(2t)}}$ to $4t - \log{t}$. Consequently, we almost determine the optimal redundancy for Hamming weight function. (2) The Hamming weight distribution function is defined by the value of Hamming weight divided by a given positive integer $T$. Previous work established that the optimal redundancy is $2t$ when $T > 2t$, while the case $T \le 2t$ remained unclear. We show that the optimal redundancy remains $2t$ when $T \ge t+1$. However, in the surprising regime where $T = o(t)$, we achieve near-optimal redundancy of $4t - o(t)$. Our results reveal a significant distinction in behavior of redundancy for distinct choices of $T$.

Optimal redundancy of function-correcting codes

TL;DR

This work advances the theory of function-correcting codes (FCCs) by precisely characterizing the redundancy required to protect key function values of binary messages under errors. It develops the distance-demand/irregular-distance code framework to relate FCC redundancy to combinatorial packing via the distance-demand matrix, and then derives tight results for two principal function classes: the Hamming weight and the Hamming weight distribution. For Hamming weight, the authors fuse a novel Gray-code–based construction with refined counting arguments to achieve near-optimal redundancy, lowering the bound to and tightening the upper bound to . For the weight distribution, they establish distinct regimes: when and, when , a near-optimal bound, with detailed constructions and bounds depending on and the quotient . The paper further provides a general construction pathway via Gray codes and specific linear codes (e.g., Simplex and Belov-type codes) to realize explicit FCCs with improved redundancy bounds. Overall, the results reveal two different redundancy behaviors under different function definitions and offer practically relevant, explicit code constructions that substantially narrow the gap toward optimal redundancy. The findings have potential implications for storage and communication systems where only particular function values need protection, and they point to extensions to symbol-pair FCCs and other function families.

Abstract

Function-correcting codes, introduced by Lenz, Bitar, Wachter-Zeh, and Yaakobi, protect specific function values of a message rather than the entire message. A central challenge is determining the optimal redundancy -- the minimum additional information required to recover function values amid errors. This redundancy depends on both the number of correctable errors and the structure of message vectors yielding identical function values. While prior works established bounds, key questions remain, such as the optimal redundancy for functions like Hamming weight and Hamming weight distribution, along with efficient code constructions. In this paper, we make the following contributions: (1) For the Hamming weight function, we improve the lower bound on optimal redundancy from to . On the other hand, we provide a systematical approach to constructing explicit FCCs via a novel connection with Gray codes, which also improve the previous upper bound from to . Consequently, we almost determine the optimal redundancy for Hamming weight function. (2) The Hamming weight distribution function is defined by the value of Hamming weight divided by a given positive integer . Previous work established that the optimal redundancy is when , while the case remained unclear. We show that the optimal redundancy remains when . However, in the surprising regime where , we achieve near-optimal redundancy of . Our results reveal a significant distinction in behavior of redundancy for distinct choices of .

Paper Structure

This paper contains 14 sections, 17 theorems, 48 equations, 1 figure, 1 table.

Key Result

Theorem 1.5

For any function $f: \mathbb{Z}_2^k \to \mathrm{Im}(f)$,

Figures (1)

  • Figure 1.1: Overview of the systematic FCC framework: In this setup, Alice wishes to transmit a message $\boldsymbol{u}$ ($\boldsymbol{u}$ can be viewed as a binary vector) to Bob, where a specific attribute $f(\boldsymbol{u})$ is of primary interest. To ensure Bob can accurately recover this attribute despite potential transmission errors, Alice encodes $\boldsymbol{u}$ into a structured codeword $\boldsymbol{c} = (\boldsymbol{u}, \boldsymbol{p})$, where $\boldsymbol{p}$ provides redundancy. Upon receiving a possibly corrupted version $\boldsymbol{y}$ of $\boldsymbol{c}$, Bob leverages his knowledge of $f$ to reliably infer $f(\boldsymbol{u})$, even without full error correction.

Theorems & Definitions (33)

  • Definition 1.1
  • Definition 1.3
  • Definition 1.4
  • Theorem 1.5: lenz2023function
  • Definition 1.6
  • Corollary 1.7: lenz2023function
  • Lemma 1.8: lenz2023function
  • Lemma 1.9: lenz2023function
  • Lemma 1.10: lenz2023functionxia2024function
  • Lemma 2.1: lenz2023function
  • ...and 23 more