Protein Structure Prediction in the 3D HP Model Using Deep Reinforcement Learning
Giovanny Espitia, Yui Tik Pang, James C. Gumbart
TL;DR
The study addresses protein structure prediction in the 3D HP lattice model, reframing folding as energy minimization via hydrophobic contacts with $E = -\big(\text{number of valid H-H contacts}\big)$. It introduces two DRL architectures—a reservoir-based hybrid (FFNN-R) and an LSTM with multi-head attention (LSTM-A)—trained under a stabilized Deep Q-Learning framework. For short sequences, FFNN-R delivers faster convergence with ~25% fewer episodes, while for longer sequences, LSTM-A captures long-range dependencies and achieves best-known values, albeit with higher compute and memory demands. The results highlight complementary strengths: efficient local pattern learning by FFNN-R and robust long-range modeling by LSTM-A, suggesting fruitful directions for hybrid designs and scalable protein-folding strategies in lattice models.
Abstract
We address protein structure prediction in the 3D Hydrophobic-Polar lattice model through two novel deep learning architectures. For proteins under 36 residues, our hybrid reservoir-based model combines fixed random projections with trainable deep layers, achieving optimal conformations with 25% fewer training episodes. For longer sequences, we employ a long short-term memory network with multi-headed attention, matching best-known energy values. Both architectures leverage a stabilized Deep Q-Learning framework with experience replay and target networks, demonstrating consistent achievement of optimal conformations while significantly improving training efficiency compared to existing methods.
