Learning Linear Block Error Correction Codes
Yoni Choukroun, Lior Wolf
TL;DR
This work tackles the challenge of jointly designing binary linear block codes and their neural decoders for short block lengths by proposing an end-to-end framework that optimizes both the code and a Transformer-based decoder in a differentiable fashion over $GF(2)$. It introduces a differentiable encoding approach with polarization-based binarization and a differentiable parity-check–driven masking for attention, enabling gradient flow through the entire encoding–decoding pipeline. Empirically, the learned codes outperform conventional codes and prior neural decoders, and also deliver improved performance when used with traditional decoders, suggesting a broadly useful approach to code design. The proposed method has practical significance for efficient ECC on edge devices and points toward new families of codes obtained via joint optimization with their decoders.
Abstract
Error correction codes are a crucial part of the physical communication layer, ensuring the reliable transfer of data over noisy channels. The design of optimal linear block codes capable of being efficiently decoded is of major concern, especially for short block lengths. While neural decoders have recently demonstrated their advantage over classical decoding techniques, the neural design of the codes remains a challenge. In this work, we propose for the first time a unified encoder-decoder training of binary linear block codes. To this end, we adapt the coding setting to support efficient and differentiable training of the code for end-to-end optimization over the order two Galois field. We also propose a novel Transformer model in which the self-attention masking is performed in a differentiable fashion for the efficient backpropagation of the code gradient. Our results show that (i) the proposed decoder outperforms existing neural decoding on conventional codes, (ii) the suggested framework generates codes that outperform the {analogous} conventional codes, and (iii) the codes we developed not only excel with our decoder but also show enhanced performance with traditional decoding techniques.
