Deep Learning Models for Physical Layer Communications
Nunzio A. Letizia
TL;DR
This thesis investigates deep learning approaches to physical layer communications, reframing channel capacity, coding, and decoding as data-driven optimization problems. It advances three interwoven streams: (1) copula-based density estimation and segmented generative networks for flexible statistical modeling of channels and dependencies; (2) end-to-end neural schemes for medium modeling, neural decoding (MIND), and capacity-oriented autoencoders that jointly maximize mutual information and decoding reliability; and (3) variational MI estimation (DIME) with derangement strategies to enable stable, scalable MI and capacity estimation across diverse channels, including PLC scenarios. Key innovations include CODINE for copula density estimation, SGN-C/SGN-D for dependence-aware data synthesis, GAN-based channel synthesis with time–frequency noise modeling, the MIND neural decoder, rate- and capacity-driven autoencoders with MI regularization, and a broad family of f-divergence MI estimators (γ-DIME, KL-/GAN-/HD-DIME) with derangement-based sampling. Collectively, these contributions yield practical, data-driven tools for channel synthesis, neural decoding in unknown channels, and capacity-approaching end-to-end communication systems, with demonstrable gains in robustness and performance across AWGN, non-Gaussian, and Rayleigh-like environments, including PLC contexts. The work outlines a cohesive framework for integrating ML with communications theory to approach, estimate, and exploit channel capacity in challenging, real-world media.
Abstract
The increased availability of data and computing resources has enabled researchers to successfully adopt machine learning (ML) techniques and make significant contributions in several engineering areas. ML and in particular deep learning (DL) algorithms have shown to perform better in tasks where a physical bottom-up description of the phenomenon is lacking and/or is mathematically intractable. Indeed, they take advantage of the observations of natural phenomena to automatically acquire knowledge and learn internal relations. Despite the historical model-based mindset, communications engineering recently started shifting the focus towards top-down data-driven learning models, especially in domains such as channel modeling and physical layer design, where in most of the cases no general optimal strategies are known. In this thesis, we aim at solving some fundamental open challenges in physical layer communications exploiting new DL paradigms. In particular, we mathematically formulate, under ML terms, classic problems such as channel capacity and optimal coding-decoding schemes, for any arbitrary communication medium. We design and develop the architecture, algorithm and code necessary to train the equivalent DL model, and finally, we propose novel solutions to long-standing problems in the field.
