A Bregman Proximal Perspective on Classical and Quantum Blahut-Arimoto Algorithms
Kerry He, James Saunderson, Hamza Fawzi
TL;DR
This work unifies classical and quantum Blahut–Arimoto algorithms within a mirror-descent framework, showing BA iterations are entropic (and more generally Bregman) proximal updates for appropriately extended objectives. It leverages relative smoothness and relative strong convexity to recover sublinear and linear convergence rates, respectively, and extends the framework to problems with general linear constraints via primal-dual hybrid gradient (PDHG), including backtracking variants. By selecting kernel functions like entropic $-H$ and the negative log-determinant, the authors derive scalable algorithms for energy-constrained channel capacities, classical and quantum rate-distortion functions, and the relative entropy of quantum resources, all with provable convergence guarantees. The framework enables efficient, scalable solutions for large-scale information-theoretic problems and offers a clear path to extending BA-type methods to new constrained or resource-theoretic quantities in classical and quantum settings.
Abstract
The Blahut-Arimoto algorithm is a well-known method to compute classical channel capacities and rate-distortion functions. Recent works have extended this algorithm to compute various quantum analogs of these quantities. In this paper, we show how these Blahut-Arimoto algorithms are special instances of mirror descent, which is a type of Bregman proximal method, and a well-studied generalization of gradient descent for constrained convex optimization. Using recently developed convex analysis tools, we show how analysis based on relative smoothness and strong convexity recovers known sublinear and linear convergence rates for Blahut-Arimoto algorithms. This Bregman proximal viewpoint allows us to derive related algorithms with similar convergence guarantees to solve problems in information theory for which Blahut-Arimoto-type algorithms are not directly applicable. We apply this framework to compute energy-constrained classical and quantum channel capacities, classical and quantum rate-distortion functions, and approximations of the relative entropy of entanglement, all with provable convergence guarantees.
