Table of Contents
Fetching ...

Near-Optimal Trace Reconstruction for Mildly Separated Strings

Anders Aamand, Allen Liu, Shyam Narayanan

TL;DR

This work tackles trace reconstruction for constant-deletion channels under the mild separation assumption that zeros between consecutive ones in $x$ number at least $L=\Theta((\log n)^8)$. It introduces a novel left-to-right alignment procedure that preserves a monotone alignment of ones and yields tight coarse gap estimates with high probability, followed by a robust forward-backward scheme to upgrade these estimates to exact gap lengths. The main contribution is a polynomial-time algorithm that uses $N=O(n\log n)$ traces to exactly reconstruct $x$ for $L$-separated strings when $\delta$ is a small constant, with a detailed probabilistic analysis ensuring that the alignment is not ahead and that the probability of falling behind is appropriately bounded. This advances the understanding of tractable instances in trace reconstruction, demonstrates near-optimal trace complexity for this class, and provides techniques (alignment plus forward/backward verification) that may inform future approaches to broader classes of strings and deletion probabilities.

Abstract

In the trace reconstruction problem our goal is to learn an unknown string $x\in \{0,1\}^n$ given independent traces of $x$. A trace is obtained by independently deleting each bit of $x$ with some probability $δ$ and concatenating the remaining bits. It is a major open question whether the trace reconstruction problem can be solved with a polynomial number of traces when the deletion probability $δ$ is constant. The best known upper bound and lower bounds are respectively $\exp(\tilde O(n^{1/5}))$ and $\tilde Ω(n^{3/2})$ both by Chase [Cha21b,Cha21a]. Our main result is that if the string $x$ is mildly separated, meaning that the number of zeros between any two ones in $x$ is at least polylog$n$, and if $δ$ is a sufficiently small constant, then the trace reconstruction problem can be solved with $O(n \log n)$ traces and in polynomial time.

Near-Optimal Trace Reconstruction for Mildly Separated Strings

TL;DR

This work tackles trace reconstruction for constant-deletion channels under the mild separation assumption that zeros between consecutive ones in number at least . It introduces a novel left-to-right alignment procedure that preserves a monotone alignment of ones and yields tight coarse gap estimates with high probability, followed by a robust forward-backward scheme to upgrade these estimates to exact gap lengths. The main contribution is a polynomial-time algorithm that uses traces to exactly reconstruct for -separated strings when is a small constant, with a detailed probabilistic analysis ensuring that the alignment is not ahead and that the probability of falling behind is appropriately bounded. This advances the understanding of tractable instances in trace reconstruction, demonstrates near-optimal trace complexity for this class, and provides techniques (alignment plus forward/backward verification) that may inform future approaches to broader classes of strings and deletion probabilities.

Abstract

In the trace reconstruction problem our goal is to learn an unknown string given independent traces of . A trace is obtained by independently deleting each bit of with some probability and concatenating the remaining bits. It is a major open question whether the trace reconstruction problem can be solved with a polynomial number of traces when the deletion probability is constant. The best known upper bound and lower bounds are respectively and both by Chase [Cha21b,Cha21a]. Our main result is that if the string is mildly separated, meaning that the number of zeros between any two ones in is at least polylog, and if is a sufficiently small constant, then the trace reconstruction problem can be solved with traces and in polynomial time.

Paper Structure

This paper contains 14 sections, 12 theorems, 21 equations, 3 algorithms.

Key Result

Theorem 1.1

There exists an algorithm that solves the trace reconstruction problem with high probability in $n$ on any $L$-separated string $x$, provided that $L\geq C(\log n)^8$ for a universal constant $C$, and that the deletion probability is at most some universal constant $c_0$. The algorithm uses $N=O(n\l

Theorems & Definitions (22)

  • Theorem 1.1
  • Lemma 3.1
  • proof
  • Lemma 3.2
  • Proposition 3.3
  • proof
  • Lemma 3.4
  • proof
  • Lemma 3.5
  • proof
  • ...and 12 more