Stochastic optimal control of open quantum systems
Aarón Villanueva, Hilbert Kappen
TL;DR
This work recasts open-quantum-system state preparation as a stochastic optimal control problem and solves it via path integral control, obviating gradient computations. By leveraging SSE unravelings and a gauge-invariant transformation to anti-Hermitian operators, the authors derive closed-form, trajectory-based expressions for optimal pulses and implement adaptive importance sampling (QDC) to estimate them efficiently. They extend QDC to function as a diffusion-based annealer to tackle unitary dynamics and demonstrate strong performance on single-qubit, multi-qubit, and NMR-inspired GHZ tasks, with notable fidelity gains and reduced variance compared to opensource gradient-based baselines. The framework is inherently parallelizable and opens a practical path toward hardware-aware quantum control, including on-hardware implementations and pulse-level control. Overall, QDC offers a scalable, gradient-free approach for optimal control in noisy quantum environments, with broad implications for quantum information processing and quantum technologies.
Abstract
We address the generic problem of optimal quantum state preparation for open quantum systems. It is well known that open quantum systems can be simulated by quantum trajectories described by a stochastic Schrödinger equation. In this context, the state preparation becomes a stochastic optimal control (SOC) problem. The latter requires the solution of the Hamilton-Jacobi-Bellman equation, which is, in general, challenging to solve. A notable exception are the so-called path integral (PI) control problems, for which one can estimate the optimal control solution by direct sampling of the cost objective. In this work, we derive a class of quantum state preparation problems that are amenable to PI control techniques, and propose a corresponding algorithm, which we call Quantum Diffusion Control (QDC). Unlike conventional quantum control algorithms, QDC avoids computing gradients of the cost function to determine the optimal control. Instead, it employs adaptive importance sampling, a technique where the controls are iteratively improved based on global averages over quantum trajectories. We also demonstrate that QDC, used as an annealer in the environmental coupling strength, finds high accuracy solutions for unitary (noiseless) quantum control problems. We further discuss the implementation of this technique on quantum hardware. We illustrate the effectiveness of our approach through examples of open-loop control for single- and multi-qubit systems.
