Table of Contents
Fetching ...

SGNet: Folding Symmetrical Protein Complex with Deep Learning

Zhaoqun Li, Jingcheng Yu, Qiwei Ye

TL;DR

A protein folding framework called SGNet is proposed to model protein-protein interactions in symmetrical assemblies and can model all global symmetry types in quaternary protein structure prediction.

Abstract

Deep learning has made significant progress in protein structure prediction, advancing the development of computational biology. However, despite the high accuracy achieved in predicting single-chain structures, a significant number of large homo-oligomeric assemblies exhibit internal symmetry, posing a major challenge in structure determination. The performances of existing deep learning methods are limited since the symmetrical protein assembly usually has a long sequence, making structural computation infeasible. In addition, multiple identical subunits in symmetrical protein complex cause the issue of supervision ambiguity in label assignment, requiring a consistent structure modeling for the training. To tackle these problems, we propose a protein folding framework called SGNet to model protein-protein interactions in symmetrical assemblies. SGNet conducts feature extraction on a single subunit and generates the whole assembly using our proposed symmetry module, which largely mitigates computational problems caused by sequence length. Thanks to the elaborate design of modeling symmetry consistently, we can model all global symmetry types in quaternary protein structure prediction. Extensive experimental results on a benchmark of symmetrical protein complexes further demonstrate the effectiveness of our method.

SGNet: Folding Symmetrical Protein Complex with Deep Learning

TL;DR

A protein folding framework called SGNet is proposed to model protein-protein interactions in symmetrical assemblies and can model all global symmetry types in quaternary protein structure prediction.

Abstract

Deep learning has made significant progress in protein structure prediction, advancing the development of computational biology. However, despite the high accuracy achieved in predicting single-chain structures, a significant number of large homo-oligomeric assemblies exhibit internal symmetry, posing a major challenge in structure determination. The performances of existing deep learning methods are limited since the symmetrical protein assembly usually has a long sequence, making structural computation infeasible. In addition, multiple identical subunits in symmetrical protein complex cause the issue of supervision ambiguity in label assignment, requiring a consistent structure modeling for the training. To tackle these problems, we propose a protein folding framework called SGNet to model protein-protein interactions in symmetrical assemblies. SGNet conducts feature extraction on a single subunit and generates the whole assembly using our proposed symmetry module, which largely mitigates computational problems caused by sequence length. Thanks to the elaborate design of modeling symmetry consistently, we can model all global symmetry types in quaternary protein structure prediction. Extensive experimental results on a benchmark of symmetrical protein complexes further demonstrate the effectiveness of our method.
Paper Structure (29 sections, 18 equations, 5 figures, 5 tables, 16 algorithms)

This paper contains 29 sections, 18 equations, 5 figures, 5 tables, 16 algorithms.

Figures (5)

  • Figure 1: An illustration example of chain index mask generation. $\otimes$ is the symmetry axis.
  • Figure 2: The pipeline of SGNet follows a two-phase, end-to-end process for predicting structure. Initially, the ASU structure is generated using both single and pair representation from a pretrained ASU structure module. Then, the symmetry module expands the ASU to the whole assembly through protein-protein interface information.
  • Figure 3: Performance variation with respect to protein length.
  • Figure 4: (A) and (B) are protein superimposition where the blue symmetry axes are also rendered. In (C) we place the ground truth on right for better visualization. (A) 2qvj, C10 symmetry. (B) 5xx9, O symmetry. (C) 5yl1, I symmetry.
  • Figure 5: Neural architecture of heads.