Trace Reconstruction of First-Order Reed-Muller Codewords Using Run Statistics
Shiv Pratap Singh Rathore, Navin Kashyap
TL;DR
We address exact reconstruction of a binary sequence from multiple traces through an i.i.d. deletion channel, focusing on codewords of the binary first-order Reed–Muller code RM$(m,1)$. The authors derive an explicit formula for the expected total number of runs $\mathbb{E}[R_x]$ in a trace as a function of the codeword $x$ and the deletion rate $q$, and specialize to $q=\tfrac{1}{2}$ to enable reconstruction. They prove that, w.h.p., a codeword from RM$(m,1)$ can be recovered from $\tilde{O}(n^2)$ traces, using a two-phase strategy: infer the first bit from $O(\log n)$ traces and then separate codewords with the same first bit by comparing the empirical mean run count against a guaranteed gap $\Delta\ge 0.028$. This work demonstrates a purely statistical use of run counts for trace reconstruction and clarifies the limitations for higher-order RM codes, outlining avenues for refining the statistic or combining it with other features.
Abstract
In this paper, we derive an expression for the expected number of runs in a trace of a binary sequence $x \in \{0,1\}^n$ obtained by passing $x$ through a deletion channel that independently deletes each bit with probability $q$. We use this expression to show that if $x$ is a codeword of a first-order Reed-Muller code, and the deletion probability $q$ is 1/2, then $x$ can be reconstructed, with high probability, from $\tilde{O}(n^2)$ many of its traces.
