Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

Wei-Fu Tseng; Kai-Chun Chen; Zi-Hong Xiao; Yen-Huan Li

Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

Wei-Fu Tseng, Kai-Chun Chen, Zi-Hong Xiao, Yen-Huan Li

TL;DR

The paper addresses online learning of quantum states with the logarithmic loss (LL-OLQS), a challenging non-Lipschitz setting that generalizes online portfolio selection to the quantum domain. It extends the VB-FTRL algorithm to LL-OLQS by introducing VB-convexity and a volumetric barrier framework, enabling a convex optimization at each round and a regret analysis analogous to the classical case. The main result is a regret bound of $O(d^2 \log(d+T))$, with each iteration solving a semidefinite program (SDP) via cutting-plane methods, and a detailed regret decomposition showing how the bias, gain, and miss terms contribute. This work advances practical online quantum state estimation and stochastic optimization for quantum tomography by providing a scalable, regret-optimal method with a principled convexity framework and explicit algorithmic steps.

Abstract

Online learning of quantum states with the logarithmic loss (LL-OLQS) is a quantum generalization of online portfolio selection (OPS), a classic open problem in online learning for over three decades. This problem also emerges in designing stochastic optimization algorithms for maximum-likelihood quantum state tomography. Recently, Jezequel et al. (arXiv:2209.13932) proposed the VB-FTRL algorithm, the first regret-optimal algorithm for OPS with moderate computational complexity. In this paper, we generalize VB-FTRL for LL-OLQS. Let $d$ denote the dimension and $T$ the number of rounds. The generalized algorithm achieves a regret rate of $O ( d^2 \log ( d + T ) )$ for LL-OLQS. Each iteration of the algorithm consists of solving a semidefinite program that can be implemented in polynomial time by, for example, cutting-plane methods. For comparison, the best-known regret rate for LL-OLQS is currently $O ( d^2 \log T )$, achieved by an exponential weight method. However, no explicit implementation is available for the exponential weight method for LL-OLQS. To facilitate the generalization, we introduce the notion of VB-convexity. VB-convexity is a sufficient condition for the volumetric barrier associated with any function to be convex and is of independent interest.

Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

TL;DR

, with each iteration solving a semidefinite program (SDP) via cutting-plane methods, and a detailed regret decomposition showing how the bias, gain, and miss terms contribute. This work advances practical online quantum state estimation and stochastic optimization for quantum tomography by providing a scalable, regret-optimal method with a principled convexity framework and explicit algorithmic steps.

Abstract

denote the dimension and

the number of rounds. The generalized algorithm achieves a regret rate of

for LL-OLQS. Each iteration of the algorithm consists of solving a semidefinite program that can be implemented in polynomial time by, for example, cutting-plane methods. For comparison, the best-known regret rate for LL-OLQS is currently

, achieved by an exponential weight method. However, no explicit implementation is available for the exponential weight method for LL-OLQS. To facilitate the generalization, we introduce the notion of VB-convexity. VB-convexity is a sufficient condition for the volumetric barrier associated with any function to be convex and is of independent interest.

Paper Structure (8 sections, 12 theorems, 23 equations)

This paper contains 8 sections, 12 theorems, 23 equations.

Introduction
Notation
VB-Convex Functions
General Theory
VB-Convexity of $\overline{L}_t$Losses and Log-Determinant Function
Proof of Theorem \ref{['thm_main']}
Algorithm
Regret Analysis

Key Result

Theorem 1.1

There is an algorithm that achieves a regret rate of $O ( d^2 \log ( d + T ) )$ for LL-OLQS. Each iteration of the algorithm consists of solving a semidefinite program.

Theorems & Definitions (18)

Theorem 1.1
Lemma 2.1: Vaidya1996
Definition 2.1: VB-Convexity
Lemma 2.2
Lemma 2.3
Lemma 2.4
Theorem 2.5
proof
Corollary 2.6
Lemma 2.7
...and 8 more

Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

TL;DR

Abstract

Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (18)