PAC Learnability for Reliable Communication over Discrete Memoryless Channels
Jiakun Liu, Wenyi Zhang, H. Vincent Poor
TL;DR
The paper addresses reliable communication over unknown discrete memoryless channels by framing decoding-metric selection and code-rate design as a PAC-learning problem. It shows that naive plug-in methods fail with finite data, and introduces the alpha-virtual-sample algorithm to produce decoding metrics with provably high LM-rate performance, enabling rates approaching the channel mutual information $I(p,w)$. Building on this, the VSEE scheme combines virtual-sample decoding with entropy-based mutual information estimation to yield learnable code rates $R$ satisfying $I(p,w)-\epsilon \le R\le I_{\mathrm{LM}}(p,w,K)$ with high probability, thereby establishing PAC learnability of DMCs. Empirical evaluations demonstrate practical training sizes sufficing for reliable metric selection and rate estimation, supporting the viability of data-driven decoding in channel uncertainty and guiding future work on input-distribution learning and tighter finite-sample bounds.
Abstract
In practical communication systems, knowledge of channel models is often absent, and consequently, transceivers need be designed based on empirical data. In this work, we study data-driven approaches to reliably choosing decoding metrics and code rates that facilitate reliable communication over unknown discrete memoryless channels (DMCs). Our analysis is inspired by the PAC (probably approximately correct) learning theory and does not rely on any assumptions on the statistical characteristics of DMCs. We show that a naive plug-in algorithm for choosing decoding metrics is likely to fail for finite training sets. We propose an alternative algorithm called the virtual sample algorithm and establish a non-asymptotic lower bound on its performance. The virtual sample algorithm is then used as a building block for constructing a learning algorithm that chooses a decoding metric and a code rate using which a transmitter and a receiver can reliably communicate at a rate arbitrarily close to the channel mutual information. Therefore, we conclude that DMCs are PAC learnable.
