Quantum Maximum Entropy Inference and Hamiltonian Learning
Minbo Gao, Zhengfeng Ji, Fuchao Wei
TL;DR
This work extends classical maximum entropy inference to quantum Hamiltonian learning by formulating Quantum Iterative Scaling (QIS) and gradient-descent (GD) on the dual problem. It provides a precise Jacobian for QIS, $J_{QIS} = \mathds{1} - P^{-1} L$, and contrasts it with $J_{GD} = \mathds{1} - \eta L$, enabling Ostrowski-based convergence analysis; by proving $L \preceq P$ and leveraging strong convexity, the authors establish a polynomial convergence rate $1 - \Omega(1/m^2)$ for local Hamiltonians. The paper also introduces two quasi-Newton accelerations—Anderson mixing for QIS and L-BFGS for GD—achieving orders-of-magnitude speedups in Hamiltonian inference experiments. These advances support efficient quantum Hamiltonian learning and highlight practical routes for accelerating fixed-point quantum optimization routines, with implications for quantum machine learning and state reconstruction.
Abstract
Maximum entropy inference and learning of graphical models are pivotal tasks in learning theory and optimization. This work extends algorithms for these problems, including generalized iterative scaling (GIS) and gradient descent (GD), to the quantum realm. While the generalization, known as quantum iterative scaling (QIS), is straightforward, the key challenge lies in the non-commutative nature of quantum problem instances, rendering the convergence rate analysis significantly more challenging than the classical case. Our principal technical contribution centers on a rigorous analysis of the convergence rates, involving the establishment of both lower and upper bounds on the spectral radius of the Jacobian matrix for each iteration of these algorithms. Furthermore, we explore quasi-Newton methods to enhance the performance of QIS and GD. Specifically, we propose using Anderson mixing and the L-BFGS method for QIS and GD, respectively. These quasi-Newton techniques exhibit remarkable efficiency gains, resulting in orders of magnitude improvements in performance. As an application, our algorithms provide a viable approach to designing Hamiltonian learning algorithms.
