How Many Matrices Should I Prepare To Polarize Channels Optimally Fast?
Hsin-Po Wang, Venkatesan Guruswami
TL;DR
This work analyzes how many kernels are needed to polarize channels optimally fast using dynamic $\\ell\\times\\ell$ kernels, quantifying a trade-off between kernel count and the scaling exponent $\\mu$. By introducing bundles of similar BMS channels and a hitting-set framework, it proves that $m = O(\\ell^{3/\\mu-1})$ kernels suffice to achieve a target $\\mu$, with $m \\approx O(\\sqrt{\\ell})$ when $\\mu \\approx 2$ and $m \\approx 1$ when $\\mu \\approx 3$. The analysis leverages degradation/upgradation concepts, probabilistic bounds on random kernels, and a geometric view of the BMS channel space to connect kernel design, channel similarity, and polarization performance. These results clarify how to balance hardware complexity and polarization speed, recovering known single-kernel advantages in special cases (e.g., BEC) while revealing fundamental limits for general BMS channels.
Abstract
Polar codes that approach capacity at a near-optimal speed, namely with scaling exponents close to $2$, have been shown possible for $q$-ary erasure channels (Pfister and Urbanke), the BEC (Fazeli, Hassani, Mondelli, and Vardy), all BMS channels (Guruswami, Riazanov, and Ye), and all DMCs (Wang and Duursma). There is, nevertheless, a subtlety separating the last two papers from the first two, namely the usage of multiple dynamic kernels in the polarization process, which leads to increased complexity and fewer opportunities to hardware-accelerate. This paper clarifies this subtlety, providing a trade-off between the number of kernels in the construction and the scaling exponent. We show that the number of kernels can be bounded by $O(\ell^{3/μ-1})$ where $μ$ is the targeted scaling exponent and $\ell$ is the kernel size. In particular, if one settles for scaling exponent approaching $3$, a single kernel suffices, and to approach the optimal scaling exponent of $2$, about $O(\sqrt{\ell})$ kernels suffice.
