GroverGPT: A Large Language Model with 8 Billion Parameters for Quantum Searching
Haoran Wang, Pingzhi Li, Min Chen, Jinglei Cheng, Junyu Liu, Tianlong Chen
TL;DR
The paper addresses the feasibility of classically simulating quantum algorithms by using task-specific LLMs to emulate Grover's search. It introduces GroverGPT, an 8B-parameter LLaMA-based model trained on 15T+ tokens with multimodal inputs (quantum circuits, QASM, and natural language) to output probability distributions that mimic quantum amplitudes without explicit state representations, achieving superior accuracy to GPT-4o on quantum-search tasks. The study shows robust generalization from training on small qubits to larger systems (up to $20$ qubits) and demonstrates that GroverGPT captures genuine quantum features, not just classical patterns, though accuracy degrades with increasing system size. It highlights the importance of prompting strategies (QASM + conversation) and data diversity for robustness and suggests that task-specific LLMs can advance quantum algorithm research, while outlining future work on noisy devices, larger qubit counts, and quantum error-correction modeling.
Abstract
Quantum computing is an exciting non-Von Neumann paradigm, offering provable speedups over classical computing for specific problems. However, the practical limits of classical simulatability for quantum circuits remain unclear, especially with current noisy quantum devices. In this work, we explore the potential of leveraging Large Language Models (LLMs) to simulate the output of a quantum Turing machine using Grover's quantum circuits, known to provide quadratic speedups over classical counterparts. To this end, we developed GroverGPT, a specialized model based on LLaMA's 8-billion-parameter architecture, trained on over 15 trillion tokens. Unlike brute-force state-vector simulations, which demand substantial computational resources, GroverGPT employs pattern recognition to approximate quantum search algorithms without explicitly representing quantum states. Analyzing 97K quantum search instances, GroverGPT consistently outperformed OpenAI's GPT-4o (45\% accuracy), achieving nearly 100\% accuracy on 6- and 10-qubit datasets when trained on 4-qubit or larger datasets. It also demonstrated strong generalization, surpassing 95\% accuracy for systems with over 20 qubits when trained on 3- to 6-qubit data. Analysis indicates GroverGPT captures quantum features of Grover's search rather than classical patterns, supported by novel prompting strategies to enhance performance. Although accuracy declines with increasing system size, these findings offer insights into the practical boundaries of classical simulatability. This work suggests task-specific LLMs can surpass general-purpose models like GPT-4o in quantum algorithm learning and serve as powerful tools for advancing quantum research.
