Fully Parallelized BP Decoding for Quantum LDPC Codes Can Outperform BP-OSD
Ming Wang, Ang Li, Frank Mueller
TL;DR
This work tackles the challenge of real-time decoding for quantum LDPC codes by introducing BP-SF, a fully parallelizable belief-propagation decoder with speculative syndrome flips guided by oscillation statistics. By avoiding costly Gaussian elimination and exploiting parallelism, BP-SF achieves logical error rates comparable to BP-OSD while delivering substantial latency reductions across code families and noise models. The approach is supported by thorough analyses of BP dynamics, a novel oscillation-guided candidate selection, and extensive simulations on code-capacity and circuit-level noise, including bivariate bicycle codes and SHYPS variants. The results indicate strong potential for hardware-friendly, real-time quantum error correction in scalable architectures, with the main limitations arising in circuit-level regimes that require more trial decoding, and future work focusing on candidate selection, sampling strategies, and inner BP improvements for further gains.
Abstract
This work presents a hardware-efficient and fully parallelizable decoder for quantum LDPC codes that leverages belief propagation (BP) with a speculative post-processing strategy inspired by classical Chase decoding algorithm. By monitoring bit-level oscillation patterns during BP, our method identifies unreliable bits and generates multiple candidate vectors to selectively flip syndromes. Each modified syndrome is then decoded independently using short-depth BP, a process we refer to as BP-SF (syndrome flip). This design eliminates the need for costly Gaussian elimination used in the current BP-OSD approaches. Our implementation achieves logical error rates comparable to or better than BP-OSD while offering significantly lower latency due to its high degree of parallelism for a variety of bivariate bicycle codes. Evaluation on the [[144,12,12]] bivariate bicycle code shows that the proposed decoder reduces average latency to approximately $70\%$ of BP-OSD. When post-processing is parallelized the average latency is reduced by $55\%$ compared to the single process implementation, with the maximum latency reaching as low as $18\%$. These advantages make it particularly well-suited for real-time and resource-constrained quantum error correction systems.
