Adiabatic Quantum Support Vector Machines
Prasanna Date, Dong Jun Woun, Kathleen Hamilton, Eduardo A. Coello Perez, Mayanka Chandra Shekhar, Francisco Rios, John Gounley, In-Saeng Suh, Travis Humble, Georgia Tourassi
TL;DR
This work tackles the computational burden of training support vector machines by recasting SVM training as a QUBO suitable for adiabatic quantum computing on the D-Wave Advantage. The authors provide a theoretical analysis showing an $\mathcal{O}(N^2)$-time/space scaling relative to the classical $\mathcal{O}(N^3)$ baseline (with fixed precision), and they demonstrate practical speedups up to about $4.5\times$ on large-feature datasets while achieving accuracies that are on par with a Scikit-learn SVM on several benchmarks. The results indicate potential quantum advantages for ML training on near-term hardware, albeit with overheads from embedding and hardware constraints, and they identify directions for kernel extensions and noise-mitigation in future work.
Abstract
Adiabatic quantum computers can solve difficult optimization problems (e.g., the quadratic unconstrained binary optimization problem), and they seem well suited to train machine learning models. In this paper, we describe an adiabatic quantum approach for training support vector machines. We show that the time complexity of our quantum approach is an order of magnitude better than the classical approach. Next, we compare the test accuracy of our quantum approach against a classical approach that uses the Scikit-learn library in Python across five benchmark datasets (Iris, Wisconsin Breast Cancer (WBC), Wine, Digits, and Lambeq). We show that our quantum approach obtains accuracies on par with the classical approach. Finally, we perform a scalability study in which we compute the total training times of the quantum approach and the classical approach with increasing number of features and number of data points in the training dataset. Our scalability results show that the quantum approach obtains a 3.5--4.5 times speedup over the classical approach on datasets with many (millions of) features.
