On Optimizing Hyperparameters for Quantum Neural Networks

Sabrina Herbst; Vincenzo De Maio; Ivona Brandic

On Optimizing Hyperparameters for Quantum Neural Networks

Sabrina Herbst, Vincenzo De Maio, Ivona Brandic

TL;DR

This work tackles the challenge of hyperparameter tuning in quantum neural networks (QNNs) by conducting an extensive empirical study across four classical datasets using IBM Qiskit. It systematically varies data encoding, optimizers, ansatz structures, entanglement patterns, and preprocessing, under both noiseless and noisy simulations. The key finding is that optimizer choice and initialization dominate trainability, with beta-distributed initialization often improving performance, especially for entangled feature maps, while entanglement strategy shows limited consistent impact. The study provides concrete guidelines and publicly available data to help researchers and practitioners design more trainable QNNs for near-term hardware, signaling practical pathways for accelerating quantum-assisted ML in the Post-MMoore era.

Abstract

The increasing capabilities of Machine Learning (ML) models go hand in hand with an immense amount of data and computational power required for training. Therefore, training is usually outsourced into HPC facilities, where we have started to experience limits in scaling conventional HPC hardware, as theorized by Moore's law. Despite heavy parallelization and optimization efforts, current state-of-the-art ML models require weeks for training, which is associated with an enormous $CO_2$ footprint. Quantum Computing, and specifically Quantum Machine Learning (QML), can offer significant theoretical speed-ups and enhanced expressive power. However, training QML models requires tuning various hyperparameters, which is a nontrivial task and suboptimal choices can highly affect the trainability and performance of the models. In this study, we identify the most impactful hyperparameters and collect data about the performance of QML models. We compare different configurations and provide researchers with performance data and concrete suggestions for hyperparameter selection.

On Optimizing Hyperparameters for Quantum Neural Networks

TL;DR

Abstract

footprint. Quantum Computing, and specifically Quantum Machine Learning (QML), can offer significant theoretical speed-ups and enhanced expressive power. However, training QML models requires tuning various hyperparameters, which is a nontrivial task and suboptimal choices can highly affect the trainability and performance of the models. In this study, we identify the most impactful hyperparameters and collect data about the performance of QML models. We compare different configurations and provide researchers with performance data and concrete suggestions for hyperparameter selection.

Paper Structure (28 sections, 7 equations, 11 figures, 10 tables)

This paper contains 28 sections, 7 equations, 11 figures, 10 tables.

Introduction
Background
Quantum Information Theory
Quantum Machine Learning
Related Work
Methodology
Data
Optimizers
Ansatzes
Circuit Structure
Qubit Entanglement
Encoding
Experimental Setup
Hardware Specifications
Experiments
...and 13 more sections

Figures (11)

Figure 1: Quantum Neural Networks
Figure 2: RealAmplitudes Ansatz
Figure 3: Cover Type: Absolute Difference in Accuracy
Figure 4: Cover Type, Noiseless: Initialization vs. Accuracy
Figure 5: KDD Cup: Nelder-Mead Convergence
...and 6 more figures

On Optimizing Hyperparameters for Quantum Neural Networks

TL;DR

Abstract

On Optimizing Hyperparameters for Quantum Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Figures (11)