Table of Contents
Fetching ...

Dataset Distillation for Quantum Neural Networks

Koustubh Phalak, Junde Li, Swaroop Ghosh

TL;DR

The paper tackles the high cost of training quantum neural networks (QNNs) on large classical datasets by applying dataset distillation to a quantum LeNet model. It introduces a quantum-classical LeNet variant with residual connections and a trainable Hermitian observable in the parametric quantum circuit, enabling a compact synthetic dataset that preserves inference performance. Empirical results show post-distillation accuracy of 91.9% on MNIST and 50.3% on CIFAR-10 for the quantum model, closely approaching the classical LeNet baselines of 94% and 54%, with stability improvements from a non-trainable Hermitian. The work highlights practical trade-offs and sets the stage for enhanced quantum feature extraction and more robust distillation methods in QNNs.

Abstract

Training Quantum Neural Networks (QNNs) on large amount of classical data can be both time consuming as well as expensive. Higher amount of training data would require higher number of gradient descent steps to reach convergence. This, in turn would imply that the QNN will require higher number of quantum executions, thereby driving up its overall execution cost. In this work, we propose performing the dataset distillation process for QNNs, where we use a novel quantum variant of classical LeNet model containing residual connection and trainable Hermitian observable in the Parametric Quantum Circuit (PQC) of the QNN. This approach yields highly informative yet small number of training data at similar performance as the original data. We perform distillation for MNIST and Cifar-10 datasets, and on comparison with classical models observe that both the datasets yield reasonably similar post-inferencing accuracy on quantum LeNet (91.9% MNIST, 50.3% Cifar-10) compared to classical LeNet (94% MNIST, 54% Cifar-10). We also introduce a non-trainable Hermitian for ensuring stability in the distillation process and note marginal reduction of up to 1.8% (1.3%) for MNIST (Cifar-10) dataset.

Dataset Distillation for Quantum Neural Networks

TL;DR

The paper tackles the high cost of training quantum neural networks (QNNs) on large classical datasets by applying dataset distillation to a quantum LeNet model. It introduces a quantum-classical LeNet variant with residual connections and a trainable Hermitian observable in the parametric quantum circuit, enabling a compact synthetic dataset that preserves inference performance. Empirical results show post-distillation accuracy of 91.9% on MNIST and 50.3% on CIFAR-10 for the quantum model, closely approaching the classical LeNet baselines of 94% and 54%, with stability improvements from a non-trainable Hermitian. The work highlights practical trade-offs and sets the stage for enhanced quantum feature extraction and more robust distillation methods in QNNs.

Abstract

Training Quantum Neural Networks (QNNs) on large amount of classical data can be both time consuming as well as expensive. Higher amount of training data would require higher number of gradient descent steps to reach convergence. This, in turn would imply that the QNN will require higher number of quantum executions, thereby driving up its overall execution cost. In this work, we propose performing the dataset distillation process for QNNs, where we use a novel quantum variant of classical LeNet model containing residual connection and trainable Hermitian observable in the Parametric Quantum Circuit (PQC) of the QNN. This approach yields highly informative yet small number of training data at similar performance as the original data. We perform distillation for MNIST and Cifar-10 datasets, and on comparison with classical models observe that both the datasets yield reasonably similar post-inferencing accuracy on quantum LeNet (91.9% MNIST, 50.3% Cifar-10) compared to classical LeNet (94% MNIST, 54% Cifar-10). We also introduce a non-trainable Hermitian for ensuring stability in the distillation process and note marginal reduction of up to 1.8% (1.3%) for MNIST (Cifar-10) dataset.

Paper Structure

This paper contains 6 sections, 1 equation, 4 figures, 2 tables.

Figures (4)

  • Figure 1: Performance matching dataset distillation process. The goal here is to distill large original training dataset into much smaller synthetic dataset such that when the same model is trained on each of the two datasets in different scenarios, the inferencing performance on the common testing set is nearly the same.
  • Figure 2: Proposed novel quantum variant of classical LeNet architecture. The convolution layers for feature extraction are kept the same but QNN layer along with residual connection is added in the classifier part between the fully connected dense layers.
  • Figure 3: QNN layer architecture containing 1. amplitude embedding, 2. PQC having strongly entangling layers schuld2020circuit and 3. measurement with Hermitian observable.
  • Figure 4: Distilled MNIST and Cifar-10 images on LeNet and Quantum LeNet models.