Table of Contents
Fetching ...

Learning to Decode Quantum LDPC Codes Via Belief Propagation

Mohsen Moradi, Vahid Nourozi, Salman Habib, David G. M. Mitchell

TL;DR

Simulation results demonstrate the superiority of the proposed RL-based QLDPC decoders in terms of performance and convergence speed when compared to flooding and random sequential schedules, while achieving performance competitive with state-of-the-art BP-based decoders at comparable complexity.

Abstract

Belief-propagation (BP) decoding for quantum low-density parity-check (QLDPC) codes is appealing due to its low complexity, yet it often exhibits convergence issues due to quantum degeneracy and short cycles that exist in the Tanner graph. To overcome this challenge, this paper proposes a reinforcement-learning (RL) approach that learns (offline) how to decode QLDPC codes based on sequential decoding trajectories. The decoding is formulated as a Markov decision process with a local, syndrome-driven state representation of the underlying RL agent. To enable fast inference, critical for practical implementation, we incrementally update our RL-based QLDPC decoder using second-order neighborhoods that avoid global rescans. Simulation results on representative QLDPC codes demonstrate the superiority of the proposed RL-based QLDPC decoders in terms of performance and convergence speed when compared to flooding and random sequential schedules, while achieving performance competitive with state-of-the-art BP-based decoders at comparable complexity.

Learning to Decode Quantum LDPC Codes Via Belief Propagation

TL;DR

Simulation results demonstrate the superiority of the proposed RL-based QLDPC decoders in terms of performance and convergence speed when compared to flooding and random sequential schedules, while achieving performance competitive with state-of-the-art BP-based decoders at comparable complexity.

Abstract

Belief-propagation (BP) decoding for quantum low-density parity-check (QLDPC) codes is appealing due to its low complexity, yet it often exhibits convergence issues due to quantum degeneracy and short cycles that exist in the Tanner graph. To overcome this challenge, this paper proposes a reinforcement-learning (RL) approach that learns (offline) how to decode QLDPC codes based on sequential decoding trajectories. The decoding is formulated as a Markov decision process with a local, syndrome-driven state representation of the underlying RL agent. To enable fast inference, critical for practical implementation, we incrementally update our RL-based QLDPC decoder using second-order neighborhoods that avoid global rescans. Simulation results on representative QLDPC codes demonstrate the superiority of the proposed RL-based QLDPC decoders in terms of performance and convergence speed when compared to flooding and random sequential schedules, while achieving performance competitive with state-of-the-art BP-based decoders at comparable complexity.
Paper Structure (28 sections, 1 theorem, 87 equations, 7 figures, 4 tables, 1 algorithm)

This paper contains 28 sections, 1 theorem, 87 equations, 7 figures, 4 tables, 1 algorithm.

Key Result

Lemma 1

For a measured syndrome $\bm{s}$, and the current hard estimate $\hat{\bm{e}}$ with the residual mismatch vector $\bm{\delta} = \bm{s} \oplus (H_1 \hat{\bm{e}}),$ if we flip the $i$-th bit of the hard estimate $\hat{\bm{e}}$ then the updated residual mismatch bits flip exactly on the neighboring che

Figures (7)

  • Figure 1: FER for the $[[882, 24, 18\leq d \leq 24]]$ code B1 with flooding BP and SVNS-BP vs. our proposed RL-SVNS-BP decoding algorithm.
  • Figure 2: FER for the $[[882,48,16]]$ code B2 over the depolarizing channel.
  • Figure 3: FER for the $[[180,10,15 \leq d \leq 18]]$ code A5 over the depolarizing channel.
  • Figure 4: FER for the $[[144,12,12]]$ BB code over the depolarizing channel.
  • Figure 5: FER for the $[[288,12,18]]$ BB code over the depolarizing channel.
  • ...and 2 more figures

Theorems & Definitions (4)

  • Lemma 1
  • proof
  • Example 1
  • Example 2