Table of Contents
Fetching ...

Quantum Probabilistic Label Refining: Enhancing Label Quality for Robust Image Classification

Fang Qi, Lu Peng, Zhengming Ding

TL;DR

The paper addresses the problem that one-hot labels induce overconfidence and brittleness under distribution shifts in image classification. It proposes Quantum Probabilistic Label Refining (QPLR), a hybrid quantum–classical approach that uses a variational quantum circuit to generate input-specific probabilistic labels via quantum non-determinism and the Born rule, which are used to train a classical CNN with soft-label targets. The key contributions include (i) leveraging angle or amplitude encoding and multi-layer VQCs to produce expressive, sample-dependent label distributions, (ii) demonstrating up to ~50% gains in robustness to noise and rotations on MNIST and Fashion-MNIST, and (iii) showing improved calibration and interpretability through human and foundation-model assessments of uncertainty. The work demonstrates that offline quantum labeling can provide architecture-agnostic improvements to classical models and offers a path toward uncertainty-aware, robust image classification with potential impact on safety-critical applications. It also discusses scalability challenges and future directions toward hardware deployment and batched quantum execution for larger-scale use cases.

Abstract

Learning with softmax cross-entropy on one-hot labels often leads to overconfident predictions and poor robustness under noise or perturbations. Label smoothing mitigates this by redistributing some confidence uniformly, but treats all samples equally, ignoring intra-class variability. We propose a hybrid quantum-classical framework that leverages quantum non-determinism to refine data labels into probabilistic ones, offering more nuanced, human-like uncertainty representations than label smoothing or Bayesian approaches. A variational quantum circuit (VQC) encodes inputs into multi-qubit quantum states, using entanglement and superposition to capture subtle feature correlations. Measurement via the Born rule extracts probabilistic soft labels that reflect input-specific uncertainty. These labels are then used to train a classical convolutional neural network (CNN) with soft-target cross-entropy loss. On MNIST and Fashion-MNIST, our method improves robustness, achieving up to 50% higher accuracy under noise while maintaining competitive accuracy on clean data. It also enhances model calibration and interpretability, as CNN outputs better reflect quantum-derived uncertainty. This work introduces Quantum Probabilistic Label Refining, bridging quantum measurement and classical deep learning for robust training via refined, correlation-aware labels without architectural changes or adversarial techniques.

Quantum Probabilistic Label Refining: Enhancing Label Quality for Robust Image Classification

TL;DR

The paper addresses the problem that one-hot labels induce overconfidence and brittleness under distribution shifts in image classification. It proposes Quantum Probabilistic Label Refining (QPLR), a hybrid quantum–classical approach that uses a variational quantum circuit to generate input-specific probabilistic labels via quantum non-determinism and the Born rule, which are used to train a classical CNN with soft-label targets. The key contributions include (i) leveraging angle or amplitude encoding and multi-layer VQCs to produce expressive, sample-dependent label distributions, (ii) demonstrating up to ~50% gains in robustness to noise and rotations on MNIST and Fashion-MNIST, and (iii) showing improved calibration and interpretability through human and foundation-model assessments of uncertainty. The work demonstrates that offline quantum labeling can provide architecture-agnostic improvements to classical models and offers a path toward uncertainty-aware, robust image classification with potential impact on safety-critical applications. It also discusses scalability challenges and future directions toward hardware deployment and batched quantum execution for larger-scale use cases.

Abstract

Learning with softmax cross-entropy on one-hot labels often leads to overconfident predictions and poor robustness under noise or perturbations. Label smoothing mitigates this by redistributing some confidence uniformly, but treats all samples equally, ignoring intra-class variability. We propose a hybrid quantum-classical framework that leverages quantum non-determinism to refine data labels into probabilistic ones, offering more nuanced, human-like uncertainty representations than label smoothing or Bayesian approaches. A variational quantum circuit (VQC) encodes inputs into multi-qubit quantum states, using entanglement and superposition to capture subtle feature correlations. Measurement via the Born rule extracts probabilistic soft labels that reflect input-specific uncertainty. These labels are then used to train a classical convolutional neural network (CNN) with soft-target cross-entropy loss. On MNIST and Fashion-MNIST, our method improves robustness, achieving up to 50% higher accuracy under noise while maintaining competitive accuracy on clean data. It also enhances model calibration and interpretability, as CNN outputs better reflect quantum-derived uncertainty. This work introduces Quantum Probabilistic Label Refining, bridging quantum measurement and classical deep learning for robust training via refined, correlation-aware labels without architectural changes or adversarial techniques.

Paper Structure

This paper contains 26 sections, 16 equations, 5 figures, 6 tables.

Figures (5)

  • Figure 1: Conceptual overview of our QPLR approach, where an input digit is encoded into a variational quantum circuit and outputs a probabilistic label distribution over the $K=10$ classes. This soft label supervises a downstream DNN to capture more realistic crowd-source knowledge.
  • Figure 2: An illustrative example of generating a soft Quantum labeling. (a) An ambiguous MNIST digit “5”. (b) A simple 4-qubit variational quantum circuit composed of state embedding, one layer of parametrized circuit, and measurement. (c) The soft label contains distributions that provide both 3 and 5 as the predictions with higher probability than other digits.
  • Figure 3: Comparison of human, foundation AI model, and quantum probabilistic predictions on ambiguous MNIST train examples. For each digit image, we report the digit recognition assigned by human annotators, ChatGPT-4, ChatGPT-4o, and Gemini 2.0 Flash, and the top 2 classes within the VQC's distribution. These results illustrate differing perceptions of uncertainty across models and humans.[Best viewed in color.]
  • Figure 4: Confusion matrices where test samples are perturbed with Gaussian noise (mean = 0, standard deviation = 0.25) and rotated by 30 degrees: (a) M1, (b) M2, (c) M3.
  • Figure 5: Samples of Wrong predictions from our M3 with all original test samples, while M1 and M2 generate predictions the same as the given labels with near-1 confidence.