Benchmarking Quantum Kernels Across Diverse and Complex Data
Yuhan Jiang, Matthew Otten
TL;DR
This work tackles the practical viability of quantum kernels on high-dimensional, real-world data by introducing a resource-efficient variational quantum kernel framework that uses two encoding schemes and a trainable ansatz with a parameter-scaling technique. It systematically benchmarks eight diverse datasets across tabular, image, time series, and graph domains, showing that correctly designed quantum kernels can surpass standard classical kernels in many cases within constrained quantum resources. The study demonstrates the benefits of amplitude- and truncated-RBF encodings, and validates that scaling the ansatz accelerates convergence and improves final accuracy, with additional gains observed as qubit resources increase. While conducted in classical simulation, the results provide a solid foundation for quantum-kernel methods in real-world ML pipelines and point to future hardware-based evaluations and more quantum-native feature-map designs.
Abstract
Quantum kernel methods are a promising branch of quantum machine learning, yet their practical advantage on diverse, high-dimensional, real-world data remains unverified. Current research has largely been limited to low-dimensional or synthetic datasets, preventing a thorough evaluation of their potential. To address this gap, we developed a variational quantum kernel framework utilizing resource-efficient ansätze for complex classification tasks and introduced a parameter scaling technique to accelerate convergence. We conducted a comprehensive benchmark of this framework on eight challenging, real world and high-dimensional datasets covering tabular, image, time series, and graph data. Our classically simulated results show that the proposed quantum kernel demonstrated a clear performance advantage over standard classical kernels, such as the radial basis function (RBF) kernel. This work demonstrates that properly designed quantum kernels can function as versatile, high-performance tools, laying a foundation for quantum-enhanced applications in real-world machine learning. Further research is needed to fully assess the practical quantum advantage.
