Table of Contents
Fetching ...

Probabilistic modeling over permutations using quantum computers

Vasilis Belis, Giulio Crognaletti, Matteo Argenton, Michele Grossi, Maria Schuld

Abstract

Quantum computers provide a super-exponential speedup for performing a Fourier transform over the symmetric group, an ability for which practical use cases have remained elusive so far. In this work, we leverage this ability to unlock spectral methods for machine learning over permutation-structured data, which appear in applications such as multi-object tracking and recommendation systems. It has been shown previously that a powerful way of building probabilistic models over permutations is to use the framework of non-Abelian harmonic analysis, as the model's group Fourier spectrum captures the interaction complexity: "low frequencies" correspond to low order correlations, and "high frequencies" to more complex ones. This can be used to construct a Markov chain model driven by alternating steps of diffusion (a group-equivariant convolution) and conditioning (a Bayesian update). However, this approach is computationally challenging and hence limited to simple approximations. Here we construct a quantum algorithm that encodes the exact probabilistic model -- a classically intractable object -- into the amplitudes of a quantum state by making use of the Quantum Fourier Transform (QFT) over the symmetric group. We discuss the scaling, limitations, and practical use of such an approach, which we envision to be a first step towards useful applications of non-Abelian QFTs.

Probabilistic modeling over permutations using quantum computers

Abstract

Quantum computers provide a super-exponential speedup for performing a Fourier transform over the symmetric group, an ability for which practical use cases have remained elusive so far. In this work, we leverage this ability to unlock spectral methods for machine learning over permutation-structured data, which appear in applications such as multi-object tracking and recommendation systems. It has been shown previously that a powerful way of building probabilistic models over permutations is to use the framework of non-Abelian harmonic analysis, as the model's group Fourier spectrum captures the interaction complexity: "low frequencies" correspond to low order correlations, and "high frequencies" to more complex ones. This can be used to construct a Markov chain model driven by alternating steps of diffusion (a group-equivariant convolution) and conditioning (a Bayesian update). However, this approach is computationally challenging and hence limited to simple approximations. Here we construct a quantum algorithm that encodes the exact probabilistic model -- a classically intractable object -- into the amplitudes of a quantum state by making use of the Quantum Fourier Transform (QFT) over the symmetric group. We discuss the scaling, limitations, and practical use of such an approach, which we envision to be a first step towards useful applications of non-Abelian QFTs.
Paper Structure (49 sections, 12 theorems, 176 equations, 4 figures)

This paper contains 49 sections, 12 theorems, 176 equations, 4 figures.

Key Result

Theorem 1

Every complex representation of a finite group is equivalent to a unitary representation.

Figures (4)

  • Figure 1: The probabilistic modeling process: The evolution of the belief state over permutations of $n$ objects is captured by the probability distribution $h^{(t)}(\sigma)$. Initialization ($t=0$) begins with a deterministic assignment (canonical configuration), visualized here for $n=3$ as $ABC \to[1,2,3]$. The process alternates between two operations: diffusion, which spreads probability mass (increasing entropy) to model the uncertainty in the system between steps; and conditioning, a Bayesian update that refines the distribution based on partial observations that are the training data. The process repeats for each observation in the training data.
  • Figure 2: Our quantum implementation: The probability distribution is encoded in the amplitudes of the quantum state $\ket{\psi^{(t)}}\propto\sum_\sigma h^{(t)}(\sigma)\ket{\sigma}$. We use Lehmer's code to map permutations $\sigma\in\mathbb{S}_n$ to $n!$ integers, which are then encoded to binary represented by computational basis states $\ket{\sigma}$. We implement diffusion in Fourier space by block-encoding an operator $\hat{q}$ (see Sec. \ref{['sec:diffusion:block_encoding']}). The conditioning step is implemented by a block-encoding of the operator $c_{\varphi}$ in direct space that exploits Lehmer's code. The Quantum Fourier Transform (QFT) over $\mathbb{S}_n$ efficiently toggles between Fourier and direct space. Each of these block-encodings requires a new ancilla. At the end, Amplitude Amplification boosts the probability that the output state encodes the full posterior probability distribution over permutations.
  • Figure 3: Scaling of the partitions of $n$ and size of $\mathbb{S}_n$. The number of partitions $p(n)$ is equal to the number of irreps (and conjugacy classes) of $\mathbb{S}_n$. While for big $n$, $p(n)$ scales super-polynomially according to the asymptotic formula in Eq. \ref{['app:eq:ramanujan']}, for moderate values of $n$$p(n)$ is significantly smaller than $n!$. As illustrated, it is possible to classically pre-compute all $c_\lambda$ ($p(n)$ values in total) for moderate $n$ since we have closed formulas, allowing for an efficient block-encoding through QROM.
  • Figure 4: Reorder-update approach: The figure summarizes the reorder-update approach focusing on the example of partial rankings with a hard likelihood. Using the same canonical ordering of Fig. \ref{['fig:markov']}, the observation A comes before C is an order $k=1$ ranking, comparing element $i_1=1$ and $i_2=3$. Conditioning is achieved reordering the elements by mapping $i_1\to 2$, i.e. choosing $\pi$ s.t. $c(\pi)_1 = 2$, computing $\varphi$ coherently, and finally restoring the canonical ordering. If a hard likelihood is used, the qubit storing $\ket{\varphi(\sigma)}$ can be directly used in the post-selection phase to implement $c_\varphi$, and no additional ancillary registers are required.

Theorems & Definitions (41)

  • Claim 1: Success probability at $t=0$
  • Claim 2: Success probability lower bounds
  • Claim 3: Cost of the reorder-update approach
  • Claim 4: Success probability
  • Claim 5: Block-Encoding of Posterior Amplitudes
  • proof
  • Definition 1: Group
  • Definition 2: Group representation
  • Theorem 1
  • Definition 3: Subrepresentation
  • ...and 31 more