AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
Xinbiao Wang, Yuxuan Du, Zihan Lou, Yang Qian, Kaining Zhang, Yong Luo, Bo Du, Dacheng Tao
TL;DR
AiDE-Q addresses the practicality gap in DL-based quantum property estimation by iteratively generating high-quality synthetic labels from a hybrid dataset with limited measurements. It uses a consistency-check to filter synthetic labels and updates the DL model across iterations, achieving up to $14.2\%$ improvement on ground-state properties in Heisenberg XXZ, cluster-Ising, and molecular H$_4$ systems up to $50$ qubits. The framework is compatible with supervised, semi-supervised, and self-supervised paradigms and demonstrates that a basic SL model with AiDE-Q can outperform more complex baselines. This work suggests synthetic data, when quality-filtered, can meaningfully extend DL utility for quantum property estimation when hardware resources are scarce.
Abstract
Quantum many-body problems are central to various scientific disciplines, yet their ground-state properties are intrinsically challenging to estimate. Recent advances in deep learning (DL) offer potential solutions in this field, complementing prior purely classical and quantum approaches. However, existing DL-based models typically assume access to a large-scale and noiseless labeled dataset collected by infinite sampling. This idealization raises fundamental concerns about their practical utility, especially given the limited availability of quantum hardware in the near term. To unleash the power of these DL-based models, we propose AiDE-Q (\underline{a}utomat\underline{i}c \underline{d}ata \underline{e}ngine for \underline{q}uantum property estimation), an effective framework that addresses this challenge by iteratively generating high-quality synthetic labeled datasets. Specifically, AiDE-Q utilizes a consistency-check method to assess the quality of synthetic labels and continuously improves the employed DL models with the identified high-quality synthetic dataset. To verify the effectiveness of AiDE-Q, we conduct extensive numerical simulations on a diverse set of quantum many-body and molecular systems, with up to 50 qubits. The results show that AiDE-Q enhances prediction performance for various reference learning models, with improvements of up to $14.2\%$. Moreover, we exhibit that a basic supervised learning model integrated with AiDE-Q outperforms advanced reference models, highlighting the importance of a synthetic dataset. Our work paves the way for more efficient and practical applications of DL for quantum property estimation.
