Table of Contents
Fetching ...

Spectral Bias in Variational Quantum Machine Learning

Callum Duffy, Marcin Jastrzebski

TL;DR

This work provides a rigorous link between spectral bias in PQCs and the redundancy of Fourier coefficients induced by data encodings. It presents a Fourier-based framework to bound gradients at each frequency and shows experimentally that higher redundancy accelerates learning of those frequencies, while initialization and entanglement modulate the effect. The findings suggest practical circuit-design strategies to tailor the spectrum for specific tasks and offer a robustness perspective tied to redundancy. While demonstrated on synthetic tasks with single-qubit encodings, the approach points to broader applicability in guiding PQC architectures for improved high-frequency generalization.

Abstract

In this work, we investigate the phenomenon of spectral bias in quantum machine learning, where, in classical settings, models tend to fit low-frequency components of a target function earlier during training than high-frequency ones, demonstrating a frequency-dependent rate of convergence. We study this effect specifically in parameterised quantum circuits (PQCs). Leveraging the established formulation of PQCs as Fourier series, we prove that spectral bias in this setting arises from the ``redundancy'' of the Fourier coefficients, which denotes the number of terms in the analytical form of the model contributing to the same frequency component. The choice of data encoding scheme dictates the degree of redundancy for a Fourier coefficient. We find that the magnitude of the Fourier coefficients' gradients during training strongly correlates with the coefficients' redundancy. We then further demonstrate this empirically with three different encoding schemes. Additionally, we demonstrate that PQCs with greater redundancy exhibit increased robustness to random perturbations in their parameters at the corresponding frequencies. We investigate how design choices affect the ability of PQCs to learn Fourier sums, focusing on parameter initialization scale and entanglement structure, finding large initializations and low-entanglement schemes tend to slow convergence.

Spectral Bias in Variational Quantum Machine Learning

TL;DR

This work provides a rigorous link between spectral bias in PQCs and the redundancy of Fourier coefficients induced by data encodings. It presents a Fourier-based framework to bound gradients at each frequency and shows experimentally that higher redundancy accelerates learning of those frequencies, while initialization and entanglement modulate the effect. The findings suggest practical circuit-design strategies to tailor the spectrum for specific tasks and offer a robustness perspective tied to redundancy. While demonstrated on synthetic tasks with single-qubit encodings, the approach points to broader applicability in guiding PQC architectures for improved high-frequency generalization.

Abstract

In this work, we investigate the phenomenon of spectral bias in quantum machine learning, where, in classical settings, models tend to fit low-frequency components of a target function earlier during training than high-frequency ones, demonstrating a frequency-dependent rate of convergence. We study this effect specifically in parameterised quantum circuits (PQCs). Leveraging the established formulation of PQCs as Fourier series, we prove that spectral bias in this setting arises from the ``redundancy'' of the Fourier coefficients, which denotes the number of terms in the analytical form of the model contributing to the same frequency component. The choice of data encoding scheme dictates the degree of redundancy for a Fourier coefficient. We find that the magnitude of the Fourier coefficients' gradients during training strongly correlates with the coefficients' redundancy. We then further demonstrate this empirically with three different encoding schemes. Additionally, we demonstrate that PQCs with greater redundancy exhibit increased robustness to random perturbations in their parameters at the corresponding frequencies. We investigate how design choices affect the ability of PQCs to learn Fourier sums, focusing on parameter initialization scale and entanglement structure, finding large initializations and low-entanglement schemes tend to slow convergence.

Paper Structure

This paper contains 23 sections, 5 theorems, 98 equations, 10 figures.

Key Result

Theorem 1

Upper bound on the gradient of the loss at frequency $\omega$ for integer-frequency models Let $f(x,\theta)$ denote the output of a PQC with an integer-valued spectrum, trained to minimise the mean squared loss with respect to a target function $h(x)$, both of which can be expressed as Fourier serie where $||O||_{\text{tr}}$ is the trace norm of $O$, which, for Hermitian matrices, is the sum of ab

Figures (10)

  • Figure 1: General reuploader circuit design with trainable gates $W(\theta)$ and data encoding gates $S(x)$.
  • Figure 2: The specific reuploader model used for the experiments, containing trainable parameters $\theta$ and data $x$. The choice of coefficients $\beta$ determines the nature of the encoding and $L$ is the number of circuit layers.
  • Figure 3: The Fourier spectra of the two Pauli encoding schemes constant and ternary with empirical data taken as the sampled mean over ten models. Depicted are the sampled theoretically accessible frequencies (light red), the mean Fourier coefficient (blue), and the total gradient of trainable parameters at each Fourier coefficient (green).
  • Figure 4: The rate at which frequencies (x-axis) are learnt during the course of training (y-axis), the colorbar measures the PQC spectrum normalised by the target amplitude at a given frequency ($|\tilde{f}_\omega|/A_i$). Each subplot depicts the training dynamics for a different encoding scheme.
  • Figure 5: Normalised Fourier spectrum of the model output (x-axis: frequency, colourbar: magnitude) as a function of parameter perturbation (y-axis). Each subplot depicts the effects of parameter perturbations for a different encoding scheme.
  • ...and 5 more figures

Theorems & Definitions (10)

  • Theorem 1
  • Theorem 2
  • Theorem 3
  • Theorem 4
  • Theorem 5: Upper bound on the robustness of PQCs to isotropic parameter perturbations
  • proof
  • proof
  • proof
  • proof
  • proof