Unified Error Correction Code Transformer with Low Complexity

Yongli Yan; Jieao Zhu; Tianyue Zheng; Zhuo Xu; Chao Jiang; Linglong Dai

Unified Error Correction Code Transformer with Low Complexity

Yongli Yan, Jieao Zhu, Tianyue Zheng, Zhuo Xu, Chao Jiang, Linglong Dai

TL;DR

This work tackles the hardware cost and scalability limits of decoding multiple linear block codes in 6G by introducing a Unified Error Correction Code Transformer (UECCT). It replaces the quadratic self-attention of prior AI decoders with a low-rank, shared Unified Attention Module that operates in $O(N)$ time, aided by a sparse mask derived from the parity-check matrix to enforce code constraints. The approach uses a standardized unit to align code lengths and a sparsity-aware mechanism to enable cross-code generalization, achieving up to $86\%$ reduction in attention complexity and improved decoding accuracy across LDPC, Polar, BCH, and other codes relative to ECCT and FECCT. Experiments show that UECCT delivers competitive or superior bit- and block-error performance while significantly reducing training and inference time, indicating strong potential for scalable, low-cost decoders in next-generation wireless systems.

Abstract

Channel coding is vital for reliable sixth-generation (6G) data transmission, employing diverse error correction codes for various application scenarios. Traditional decoders require dedicated hardware for each code, leading to high hardware costs. Recently, artificial intelligence (AI)-driven approaches, such as the error correction code Transformer (ECCT) and its enhanced version, the foundation error correction code Transformer (FECCT), have been proposed to reduce the hardware cost by leveraging the Transformer to decode multiple codes. However, their excessively high computational complexity of $\mathcal{O}(N^2)$ due to the self-attention mechanism in the Transformer limits scalability, where $N$ represents the sequence length. To reduce computational complexity, we propose a unified Transformer-based decoder that handles multiple linear block codes within a single framework. Specifically, a standardized unit is employed to align code length and code rate across different code types, while a redesigned low-rank unified attention module, with computational complexity of $\mathcal{O}(N)$, is shared across various heads in the Transformer. Additionally, a sparse mask, derived from the parity-check matrix's sparsity, is introduced to enhance the decoder's ability to capture inherent constraints between information and parity-check bits, improving decoding accuracy and further reducing computational complexity by $86\%$. Extensive experimental results demonstrate that the proposed unified Transformer-based decoder outperforms existing methods and provides a high-performance, low-complexity solution for next-generation wireless communication systems.

Unified Error Correction Code Transformer with Low Complexity

TL;DR

time, aided by a sparse mask derived from the parity-check matrix to enforce code constraints. The approach uses a standardized unit to align code lengths and a sparsity-aware mechanism to enable cross-code generalization, achieving up to

reduction in attention complexity and improved decoding accuracy across LDPC, Polar, BCH, and other codes relative to ECCT and FECCT. Experiments show that UECCT delivers competitive or superior bit- and block-error performance while significantly reducing training and inference time, indicating strong potential for scalable, low-cost decoders in next-generation wireless systems.

Abstract

due to the self-attention mechanism in the Transformer limits scalability, where

represents the sequence length. To reduce computational complexity, we propose a unified Transformer-based decoder that handles multiple linear block codes within a single framework. Specifically, a standardized unit is employed to align code length and code rate across different code types, while a redesigned low-rank unified attention module, with computational complexity of

, is shared across various heads in the Transformer. Additionally, a sparse mask, derived from the parity-check matrix's sparsity, is introduced to enhance the decoder's ability to capture inherent constraints between information and parity-check bits, improving decoding accuracy and further reducing computational complexity by

. Extensive experimental results demonstrate that the proposed unified Transformer-based decoder outperforms existing methods and provides a high-performance, low-complexity solution for next-generation wireless communication systems.

Paper Structure (17 sections, 18 equations, 16 figures, 2 tables)

This paper contains 17 sections, 18 equations, 16 figures, 2 tables.

Introduction
Prior Works
Contributions
Background
Vanilla Transformers
Forward Error Correction Codes
Mask Generation in ECCT and FECCT
Proposed Unified Error Correction Code Transformer
Shared Unified Attention
Sparse Masked Unified Attention
Architecture and Training Methodology
Experiments
Analysis
Impact of Encoder Layers and Embedding Dimensions
Impact of Sparse Masked Attention
...and 2 more sections

Figures (16)

Figure 1: Single-layer vanilla Transformer encoder architecture.
Figure 2: Architecture of multi-head self-attention.
Figure 3: Architecture of the communication system.
Figure 4: Decoder framework with pre- and post-processing.
Figure 5: Rank distribution of the $\textbf{Q} \cdot \textbf{K}^T$ matrix in the ECCT model across 6 encoder layers for various code types.
...and 11 more figures

Unified Error Correction Code Transformer with Low Complexity

TL;DR

Abstract

Unified Error Correction Code Transformer with Low Complexity

Authors

TL;DR

Abstract

Table of Contents

Figures (16)