Low-latency machine learning FPGA accelerator for multi-qubit-state discrimination
Pradeep Kumar Gautam, Shantharam Kalipatnapu, Shankaranarayanan H, Ujjawal Singhal, Benjamin Lienhard, Vibhor Singh, Chetan Singh Thakur
TL;DR
The paper tackles the challenge of ultra-low-latency, high-fidelity multi-qubit state discrimination for frequency-multiplexed readout on an FPGA platform. It develops an end-to-end NN-based discriminator trained with quantization-aware training and deployed via the FINN-R dataflow flow to a RFSoC ZCU111, achieving latencies below 50 ns for five qubits while preserving fidelity ($F_{ m GM}\approx 0.90$). It demonstrates architectural strategies to maximize FPGA parallelism (input segmentation, Concat-free dataflow) and compares with SVM/matched-filter baselines, showing favorable latency with a larger parameter budget. The work provides a scalable, automated path to integrate NN-based readout into quantum control stacks, with relevance for improving QEC cycle rates and robustness to readout crossticks.
Abstract
Measuring a qubit state is a fundamental yet error-prone operation in quantum computing. These errors can arise from various sources, such as crosstalk, spontaneous state transitions, and excitations caused by the readout pulse. Here, we utilize an integrated approach to deploy neural networks onto field-programmable gate arrays (FPGA). We demonstrate that implementing a fully connected neural network accelerator for multi-qubit readout is advantageous, balancing computational complexity with low latency requirements without significant loss in accuracy. The neural network is implemented by quantizing weights, activation functions, and inputs. The hardware accelerator performs frequency-multiplexed readout of five superconducting qubits in less than 50 ns on a radio frequency system on chip (RFSoC) ZCU111 FPGA, marking the advent of RFSoC-based low-latency multi-qubit readout using neural networks. These modules can be implemented and integrated into existing quantum control and readout platforms, making the RFSoC ZCU111 ready for experimental deployment.
