End-to-End Learning of Geometrical Shaping Maximizing Generalized Mutual Information
Kadir Gümüs, Alex Alvarado, Bin Chen, Christian Häger, Erik Agrell
TL;DR
This paper tackles the nonconvexity of GMI-based end-to-end learning for geometric shaping in optical communications. It shows that standard AE optimization is prone to poor local optima and that domain-specific initialization, especially Gray-labeled constellations, is essential. The authors propose a fast coordinate-optimization approach—eliminating hidden layers so constellation coordinates are directly optimized—initialized with Gray-labeled APSK and evaluated with Gauss-Hermite quadratures. The method delivers state-of-the-art geometrical constellations up to M=1024 in 2D and M=64 in 4D, with SNR gains up to about 1 dB and reach gains up to 26% over QAM, while significantly reducing computation time, highlighting practical impact for high-rate, fiber-optic systems.
Abstract
GMI-based end-to-end learning is shown to be highly nonconvex. We apply gradient descent initialized with Gray-labeled APSK constellations directly to the constellation coordinates. State-of-the-art constellations in 2D and 4D are found providing reach increases up to 26\% w.r.t. to QAM.
