Table of Contents
Fetching ...

Rigorous Maximum Likelihood Estimation for Quantum States

Kuchibhotla Aditi, Stephen Becker

TL;DR

This work addresses the exponential challenge of quantum state tomography by recasting maximum likelihood estimation through a Burer–Monteiro factorization, writing $\rho = UU^{\dagger}$ to enforce PSD structure and enable low-rank modeling. By analytically eliminating the trace constraint and choosing a priori $\lambda = \sum_i f_i$, the authors obtain a fully unconstrained BM–MLE objective $J_\lambda(U)$ and develop scalable first-order optimization methods (notably L-BFGS) along with a low-memory BM scheme that avoids forming $\rho$ explicitly. They provide a rigorous a posteriori error bound and demonstrate competitive performance on moderate systems, plus a substantial advance in scalability by solving a 20-qubit state tomography problem on a laptop within hours using Pauli and tetrahedral POVMs. Overall, the framework delivers a principled, scalable approach to QST that does not rely on tensor-network structure, enabling more practical and rigorous tomography for large quantum devices and facilitating uncertainty quantification and benchmarking.

Abstract

Existing quantum state tomography methods are limited in scalability due to their high computation and memory demands, making them impractical for recovery of large quantum states. In this work, we address these limitations by reformulating the maximum likelihood estimation (MLE) problem using the Burer-Monteiro factorization, resulting in a non-convex but low-rank parameterization of the density matrix. We derive a fully unconstrained formulation by analytically eliminating the trace-one and positive semidefinite constraints, thereby avoiding the need for projection steps during optimization. Furthermore, we determine the Lagrange multiplier associated with the unit-trace constraint a priori, reducing computational overhead. The resulting formulation is amenable to scalable first-order optimization, and we demonstrate its tractability using limited-memory BFGS (L-BFGS). Importantly, we also propose a low-memory version of the above algorithm to fully recover certain large quantum states with Pauli-based POVM measurements. Our low-memory algorithm avoids explicitly forming any density matrix, and does not require the density matrix to have a matrix product state (MPS) or other tensor structure. For a fixed number of measurements and fixed rank, our algorithm requires just $\mathcal{O}(d \log d)$ complexity per iteration to recover a $d \times d$ density matrix. Additionally, we derive a useful error bound that can be used to give a rigorous termination criterion. We numerically demonstrate that our method is competitive with state-of-the-art algorithms for moderately sized problems, and then demonstrate that our method can solve a 20-qubit problem on a laptop in under 5 hours.

Rigorous Maximum Likelihood Estimation for Quantum States

TL;DR

This work addresses the exponential challenge of quantum state tomography by recasting maximum likelihood estimation through a Burer–Monteiro factorization, writing to enforce PSD structure and enable low-rank modeling. By analytically eliminating the trace constraint and choosing a priori , the authors obtain a fully unconstrained BM–MLE objective and develop scalable first-order optimization methods (notably L-BFGS) along with a low-memory BM scheme that avoids forming explicitly. They provide a rigorous a posteriori error bound and demonstrate competitive performance on moderate systems, plus a substantial advance in scalability by solving a 20-qubit state tomography problem on a laptop within hours using Pauli and tetrahedral POVMs. Overall, the framework delivers a principled, scalable approach to QST that does not rely on tensor-network structure, enabling more practical and rigorous tomography for large quantum devices and facilitating uncertainty quantification and benchmarking.

Abstract

Existing quantum state tomography methods are limited in scalability due to their high computation and memory demands, making them impractical for recovery of large quantum states. In this work, we address these limitations by reformulating the maximum likelihood estimation (MLE) problem using the Burer-Monteiro factorization, resulting in a non-convex but low-rank parameterization of the density matrix. We derive a fully unconstrained formulation by analytically eliminating the trace-one and positive semidefinite constraints, thereby avoiding the need for projection steps during optimization. Furthermore, we determine the Lagrange multiplier associated with the unit-trace constraint a priori, reducing computational overhead. The resulting formulation is amenable to scalable first-order optimization, and we demonstrate its tractability using limited-memory BFGS (L-BFGS). Importantly, we also propose a low-memory version of the above algorithm to fully recover certain large quantum states with Pauli-based POVM measurements. Our low-memory algorithm avoids explicitly forming any density matrix, and does not require the density matrix to have a matrix product state (MPS) or other tensor structure. For a fixed number of measurements and fixed rank, our algorithm requires just complexity per iteration to recover a density matrix. Additionally, we derive a useful error bound that can be used to give a rigorous termination criterion. We numerically demonstrate that our method is competitive with state-of-the-art algorithms for moderately sized problems, and then demonstrate that our method can solve a 20-qubit problem on a laptop in under 5 hours.

Paper Structure

This paper contains 27 sections, 7 theorems, 29 equations, 9 figures, 3 tables, 8 algorithms.

Key Result

Proposition 1

If there is a physical quantum state $\rho_0$ such that $f_i = \mathop{\mathrm{tr}}\limits(A_i \rho_0)$ (i.e., $N\to\infty$) for all $i=1,\ldots,\widetilde{m}$, then $\rho_0$ is a solution to the MLE optimization problem. If the set of POVMs is informationally complete, then furthermore $\rho_0$ is

Figures (9)

  • Figure 1: Diagram of our approach to solving Eq. \ref{['cvx-obj']} by solving Eq. \ref{['eq:Lag-U']}.
  • Figure 2: Runtime performance of the proposed low-memory BM-MLE for large qubit systems. (a) Time taken for calculation of probability values as a function of $M$ with varied number of qubits $n$. (b) Observed runtime complexity of the proposed algorithm as a function of Hilbert space dimension $d$; $\mathcal{O}(d)$, $\mathcal{O}(d\log d)$ and $\mathcal{O}(d^2)$ scalings shown for reference.
  • Figure 3: Solving the toy problem with parameters $a=1.1$, $b=c=10^4$ and $d=1$, starting at $\boldsymbol{u}_0 = (.99,.01)$; the optimal solution is at $(0.500005, 0.499995)$. Left: linear $x$-scale. Right: logarithmic $x$-scale. The two unconstrained solvers solve Eq. \ref{['problem:toy_unconstrained']} and converge quickly, whereas the constrained solvers that solve the (equivalent) problem Eq. \ref{['problem:toy']} converge slowly or not at all.
  • Figure 4: Error performance of different methods for complete Pauli POVM measurements under $10\%$ depolarizing noise on the $W$ state for — (a) $n=4$ and (b) $n=10$. Note that the number of iterations is not equivalent to runtime; the per-iteration runtimes are listed in Table \ref{['tab:timing']}, where our algorithm demonstrates better performance than the others.
  • Figure 5: Comparison of different methods in terms of time taken to reach the two accuracy regimes — (a) Low-accuracy regime and (b) High-accuracy regime, under complete Pauli POVM measurements and $10\%$ depolarizing noise on the $W$ state. "L-BFGS" and "Acc-GD" are our proposed methods, Algo. \ref{['algo:L-BFGS']} and Algo. \ref{['algo:AGD']}, resp. Algorithms "LBSDA" and "MEG" never converge to the required accuracy, so they are not shown.
  • ...and 4 more figures

Theorems & Definitions (13)

  • Proposition 1: Solution of the MLE with infinite measurements
  • proof
  • Remark 1: Error metrics
  • Theorem 1: Burer-Monteiro style result, simplified version of Thm. 4.1 in AravkinBM
  • Theorem 2
  • proof
  • Theorem 3: Unconstrained MLE
  • Theorem 4
  • proof : Proof of Thm. \ref{['thm:equivalence']}
  • proof : Proof of Thm. \ref{['thm:nonconvex-equivalence']}
  • ...and 3 more