Harnessing Hypergraphs in Geometric Deep Learning for 3D RNA Inverse Folding
Guang Yang, Lei Fan
TL;DR
This work tackles designing RNA sequences that realize target 3D structures by introducing HyperRNA, a hypergraph-based encoder–decoder framework. It preprocesses RNA–protein backbones with a 3-bead coarse-grained representation, applies a self-attention–augmented HGNN to capture high-order interactions, and autoregressively decodes sequences using geometric vector perceptrons. Training combines sequence and structure losses, while inference uses flow-based trajectory refinement to account for RNA flexibility. Across PDBBind and RNAsolo benchmarks, HyperRNA demonstrates superior structural accuracy and diverse, valid sequence design, highlighting the value of hypergraphs in RNA design. The results underscore a promising direction for RNA engineering that better accounts for multi-way nucleotide interactions and 3D geometry.
Abstract
The RNA inverse folding problem, a key challenge in RNA design, involves identifying nucleotide sequences that can fold into desired secondary structures, which are critical for ensuring molecular stability and function. The inherent complexity of this task stems from the intricate relationship between sequence and structure, making it particularly challenging. In this paper, we propose a framework, named HyperRNA, a generative model with an encoder-decoder architecture that leverages hypergraphs to design RNA sequences. Specifically, our HyperRNA model consists of three main components: preprocessing, encoding and decoding. In the preprocessing stage, graph structures are constructed by extracting the atom coordinates of RNA backbone based on 3-bead coarse-grained representation. The encoding stage processes these graphs, capturing higher order dependencies and complex biomolecular interactions using an attention embedding module and a hypergraph-based encoder. Finally, the decoding stage generates the RNA sequence in an autoregressive manner. We conducted quantitative and qualitative experiments on the PDBBind and RNAsolo datasets to evaluate the inverse folding task for RNA sequence generation and RNA-protein complex sequence generation. The experimental results demonstrate that HyperRNA not only outperforms existing RNA design methods but also highlights the potential of leveraging hypergraphs in RNA engineering.
