WSBD: Freezing-Based Optimizer for Quantum Neural Networks
Christopher Kverne, Mayur Akewar, Yuqian Huo, Tirthak Patel, Janki Bhimani
TL;DR
This work tackles the high cost of gradient estimation and barren plateaus in quantum neural network (QNN) training by introducing Weighted Stochastic Block Descent (WSBD), a dynamic, parameter-wise freezing optimizer. WSBD computes a gradient-based importance score, then stochastically freezes less influential parameters in training windows to reduce forward passes while preserving full expressivity; scores are reset when parameters re-enter the active set. The authors provide a formal convergence proof and demonstrate substantial, scalable efficiency gains across MNIST, parity, and VQE tasks, with robustness to hardware noise. Ablation studies highlight the importance of stochastic freezing, granular parameter-wise decisions, and adaptive score resets. The approach yields practical speedups and identifies a principled direction for hardware-aware optimization in QML.
Abstract
The training of Quantum Neural Networks (QNNs) is hindered by the high computational cost of gradient estimation and the barren plateau problem, where optimization landscapes become intractably flat. To address these challenges, we introduce Weighted Stochastic Block Descent (WSBD), a novel optimizer with a dynamic, parameter-wise freezing strategy. WSBD intelligently focuses computational resources by identifying and temporarily freezing less influential parameters based on a gradient-derived importance score. This approach significantly reduces the number of forward passes required per training step and helps navigate the optimization landscape more effectively. Unlike pruning or layer-wise freezing, WSBD maintains full expressive capacity while adapting throughout training. Our extensive evaluation shows that WSBD converges on average 63.9% faster than Adam for the popular ground-state-energy problem, an advantage that grows with QNN size. We provide a formal convergence proof for WSBD and show that parameter-wise freezing outperforms traditional layer-wise approaches in QNNs. Project page: https://github.com/Damrl-lab/WSBD-Stochastic-Freezing-Optimizer.
