Quantum Maximum Entropy Inference and Hamiltonian Learning

Minbo Gao; Zhengfeng Ji; Fuchao Wei

Quantum Maximum Entropy Inference and Hamiltonian Learning

Minbo Gao, Zhengfeng Ji, Fuchao Wei

TL;DR

This work extends classical maximum entropy inference to quantum Hamiltonian learning by formulating Quantum Iterative Scaling (QIS) and gradient-descent (GD) on the dual problem. It provides a precise Jacobian for QIS, $J_{QIS} = \mathds{1} - P^{-1} L$, and contrasts it with $J_{GD} = \mathds{1} - \eta L$, enabling Ostrowski-based convergence analysis; by proving $L \preceq P$ and leveraging strong convexity, the authors establish a polynomial convergence rate $1 - \Omega(1/m^2)$ for local Hamiltonians. The paper also introduces two quasi-Newton accelerations—Anderson mixing for QIS and L-BFGS for GD—achieving orders-of-magnitude speedups in Hamiltonian inference experiments. These advances support efficient quantum Hamiltonian learning and highlight practical routes for accelerating fixed-point quantum optimization routines, with implications for quantum machine learning and state reconstruction.

Abstract

Maximum entropy inference and learning of graphical models are pivotal tasks in learning theory and optimization. This work extends algorithms for these problems, including generalized iterative scaling (GIS) and gradient descent (GD), to the quantum realm. While the generalization, known as quantum iterative scaling (QIS), is straightforward, the key challenge lies in the non-commutative nature of quantum problem instances, rendering the convergence rate analysis significantly more challenging than the classical case. Our principal technical contribution centers on a rigorous analysis of the convergence rates, involving the establishment of both lower and upper bounds on the spectral radius of the Jacobian matrix for each iteration of these algorithms. Furthermore, we explore quasi-Newton methods to enhance the performance of QIS and GD. Specifically, we propose using Anderson mixing and the L-BFGS method for QIS and GD, respectively. These quasi-Newton techniques exhibit remarkable efficiency gains, resulting in orders of magnitude improvements in performance. As an application, our algorithms provide a viable approach to designing Hamiltonian learning algorithms.

Quantum Maximum Entropy Inference and Hamiltonian Learning

TL;DR

, and contrasts it with

, enabling Ostrowski-based convergence analysis; by proving

and leveraging strong convexity, the authors establish a polynomial convergence rate

for local Hamiltonians. The paper also introduces two quasi-Newton accelerations—Anderson mixing for QIS and L-BFGS for GD—achieving orders-of-magnitude speedups in Hamiltonian inference experiments. These advances support efficient quantum Hamiltonian learning and highlight practical routes for accelerating fixed-point quantum optimization routines, with implications for quantum machine learning and state reconstruction.

Abstract

Paper Structure (22 sections, 11 theorems, 44 equations, 7 figures, 1 table, 2 algorithms)

This paper contains 22 sections, 11 theorems, 44 equations, 7 figures, 1 table, 2 algorithms.

Introduction
Explicit Jacobian Formula for Iterations.
Upper and Lower Bounds for the Jacobian.
Accelerations.
Preliminary
Quantum Iterative Scaling
Convergence Rate
Acceleration by Quasi-Newton Methods
Experiments
Discussions
Prime and Dual Divergence Minimization Problems
Proofs for Convergence Rate
Bounds on the Hessian Matrix
Discussions on Quasi-Newton Methods
Experimental Details
...and 7 more sections

Key Result

Theorem 4.1

The Jacobian of the iterative update map $\lambda^{(t)} \mapsto \lambda^{(t+1)}$ of alg:qis for $\sigma_{0} = \mathds{1}/d$ is given by $\mathds{1} - P^{-1} L$ for $P$ and $L$ defined above with $\lambda = \lambda^{(t)}$.

Figures (7)

Figure 1: The maximum entropy problem and its dual problem. Here, $S(\rho) = -\mathop{\mathrm{tr}}\nolimits(\rho \ln \rho)$ is the von Neumann entropy of $\rho$.
Figure 2: Comparison of QIS and GD algorithms. The loss in measured by the error in the objective function of the maximum entropy problem.
Figure 3: Comparisons of the AM-QIS and L-BFGS-GD algorithms. The \ref{['fig:amqis-lbfgs-a']} on the left uses the Barzilai-Borwein method to choose the step size and \ref{['fig:amqis-lbfgs-b']} on the right uses fixed step size. The dotted (red) and dashed (green) lines represent the performance of the algorithms when the input Hamiltonian terms are complete satisfying $\sum_{j} F_{j} = \mathds{1}$.
Figure 4: Two optimization problems of the Kullback-Leibler divergence that are dual to each other.
Figure 5: Results of the Ising Hamiltonian for $7$ qubits.
...and 2 more figures

Theorems & Definitions (18)

Theorem 4.1
Theorem 4.2
Theorem 4.3
Theorem 4.4
proof : Proof of \ref{['thm:qis-convergence']}
proof : Proof of \ref{['thm:jacobian-qis']}
proof : Proof of \ref{['thm:jacobian-gd']}
Theorem B.1: Ostrowski's theorem Ost66
Lemma C.1: Quantum Belief Propagation Has07
Lemma C.2: Bochner's Theorem
...and 8 more

Quantum Maximum Entropy Inference and Hamiltonian Learning

TL;DR

Abstract

Quantum Maximum Entropy Inference and Hamiltonian Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (18)