PolarZero: A Reinforcement Learning Approach for Low-Complexity Polarization Kernel Design
Yi-Ting Hong, Stefano Rini, Luca Barletta
TL;DR
This work tackles designing large-polarization kernels with low decoding complexity by framing kernel search as a reinforcement learning problem. It introduces PolarZero, which leverages a Gumbel AlphaZero framework to construct kernel matrices that satisfy a target partial distance profile while minimizing RMLD complexity, including innovations like randomized initialization and multi-size training. Empirically, PolarZero discovers a 16×16 kernel with about 17% lower decoding cost than handcrafted designs at an exponent of approximately $0.5183$, outperforming Arıkan’s kernel ($E=0.5$). The results demonstrate a practical, data-driven pathway to tailor polar codes for hardware-friendly decoding, latency, and memory constraints, with broader implications for automated code design under implementation constraints.
Abstract
Polar codes with large kernels can achieve improved error exponents but are challenging to design with low decoding complexity. This work investigates kernel construction under recursive maximum likelihood decoding (RMLD) using a reinforcement learning framework based on the Gumbel AlphaZero algorithm. The proposed method efficiently explores the design space and identifies large-size kernels that satisfy a given error exponent while minimizing decoding complexity. For a size-16 kernel, it achieves 17% lower decoding complexity than handcrafted designs while reaching an error exponent of 0.5183 compared to 0.5 for Arikan's kernel, demonstrating the effectiveness of the learning-based approach for practical polar code construction.
