Quantum Non-Linear Bandit Optimization

Zakaria Shams Siam; Chaowen Guan; Chong Liu

Quantum Non-Linear Bandit Optimization

Zakaria Shams Siam, Chaowen Guan, Chong Liu

TL;DR

This work introduces Q-NLB-UCB, a quantum algorithm for non-linear bandit optimization that uses parametric function approximation to achieve an input-dimension-free regret bound of $R_T = Oig(d_w^2 ext{log}^{3/2}(T) ext{log}(d_w ext{log} T)ig)$. Central to the approach are a quantum regression oracle, quantum fast-forward, and quantum Monte Carlo mean estimation, which together yield accelerated estimation of the surrogate parameters and efficient, staged uncertainty management. The method generalizes beyond kernels by allowing flexible surrogate families (linear, quadratic, or neural networks) and demonstrates superior performance over quantum baselines on synthetic benchmarks and real AutoML tasks. The results suggest that quantum-enhanced, parametric surrogate-based bandits can solve high-dimensional, black-box optimization problems more efficiently than prior RKHS-based quantum methods, with potential impact on hyperparameter tuning and drug discovery. The paper provides both rigorous regret guarantees and empirical validation to support these claims.

Abstract

We study non-linear bandit optimization where the learner maximizes a black-box function with zeroth order function oracle, which has been successfully applied in many critical applications such as drug discovery and hyperparameter tuning. Existing works have showed that with the aid of quantum computing, it is possible to break the $Ω(\sqrt{T})$ regret lower bound in classical settings and achieve the new $O(\mathrm{poly}\log T)$ upper bound. However, they usually assume that the objective function sits within the reproducing kernel Hilbert space and their algorithms suffer from the curse of dimensionality. In this paper, we propose the new Q-NLB-UCB algorithm which uses the novel parametric function approximation technique and enjoys performance improvement due to quantum fast-forward and quantum Monte Carlo mean estimation. We prove that the regret bound of Q-NLB-UCB is not only $O(\mathrm{poly}\log T)$ but also input dimension-free, making it applicable for high-dimensional tasks. At the heart of our analyses are a new quantum regression oracle and a careful construction of parameter uncertainty region. Our algorithm is also validated for its efficiency on both synthetic and real-world tasks.

Quantum Non-Linear Bandit Optimization

TL;DR

This work introduces Q-NLB-UCB, a quantum algorithm for non-linear bandit optimization that uses parametric function approximation to achieve an input-dimension-free regret bound of

. Central to the approach are a quantum regression oracle, quantum fast-forward, and quantum Monte Carlo mean estimation, which together yield accelerated estimation of the surrogate parameters and efficient, staged uncertainty management. The method generalizes beyond kernels by allowing flexible surrogate families (linear, quadratic, or neural networks) and demonstrates superior performance over quantum baselines on synthetic benchmarks and real AutoML tasks. The results suggest that quantum-enhanced, parametric surrogate-based bandits can solve high-dimensional, black-box optimization problems more efficiently than prior RKHS-based quantum methods, with potential impact on hyperparameter tuning and drug discovery. The paper provides both rigorous regret guarantees and empirical validation to support these claims.

Abstract

regret lower bound in classical settings and achieve the new

upper bound. However, they usually assume that the objective function sits within the reproducing kernel Hilbert space and their algorithms suffer from the curse of dimensionality. In this paper, we propose the new Q-NLB-UCB algorithm which uses the novel parametric function approximation technique and enjoys performance improvement due to quantum fast-forward and quantum Monte Carlo mean estimation. We prove that the regret bound of Q-NLB-UCB is not only

but also input dimension-free, making it applicable for high-dimensional tasks. At the heart of our analyses are a new quantum regression oracle and a careful construction of parameter uncertainty region. Our algorithm is also validated for its efficiency on both synthetic and real-world tasks.

Quantum Non-Linear Bandit Optimization

TL;DR

Abstract

Quantum Non-Linear Bandit Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (35)