A Quantum Framework for Protein Binding-Site Structure Prediction on Utility-Level Quantum Processors
Yuqi Zhang, Yuxin Yang, William Martin, Kingsten Lin, Zixu Wang, Cheng-Chang Lu, Weiwen Jiang, Ruth Nussinov, Joseph Loscalzo, Qiang Guan, Feixiong Cheng
TL;DR
The work tackles local protein binding-site structure prediction for short, flexible fragments, a regime where classical and DL methods struggle. It introduces an end-to-end quantum framework that maps sequences to a tetrahedral backbone lattice and encodes constraints into a sparse Pauli Hamiltonian, optimized via Variational Quantum Eigensolver (VQE) with an EfficientSU2 ansatz. A two-stage execution strategy mitigates noise by separating energy estimation from measurement decoding, enabling hardware-executable predictions on utility-level quantum processors. Hardware experiments on IBM’s 127-qubit processor show quantum predictions with lower RMSD and stronger docking affinities than AlphaFold3 and classical baselines, highlighting practical viability and a path toward quantum-assisted docking and design workflows. The results validate a concrete, hardware-feasible quantum pipeline for structural biology with potential impact on drug discovery and protein engineering.
Abstract
Accurate prediction of protein active-site structures remains a central challenge in structural biology, particularly for short and flexible peptide fragments where conventional and simulation-based methods often fail. Here, we present a quantum computing framework specifically developed for utility-level quantum processors to address this problem. Starting from an amino acid sequence, we formulate structure prediction as a ground-state energy minimization problem using the Variational Quantum Eigensolver (VQE). Amino acid connectivity is encoded on a tetrahedral lattice model, and structural constraints-including steric, geometric, and chirality terms-are mapped into a problem-specific Hamiltonian represented as sparse Pauli operators. Optimization is performed with a two-stage architecture that separates energy estimation from measurement decoding, enabling noise mitigation under realistic device conditions. We evaluate the framework on 23 randomly selected protein fragments from the PDBbind dataset and 7 fragments from therapeutically relevant proteins, and execute experiments on the IBM-Cleveland Clinic quantum processor. Predictions are benchmarked against AlphaFold 3 (AF3) and classical simulation-based approaches using identical postprocessing and docking procedures. Our method outperforms both AF3 and classical baselines in RMSD (root-mean-square deviation) and docking efficacy. These results demonstrate an end-to-end, hardware-executable pipeline for biologically relevant structure prediction on real quantum processors, highlighting its engineering feasibility and practical advantages over existing classical and deep learning approaches.
