AF2-Mutation: Adversarial Sequence Mutations against AlphaFold2 on Protein Tertiary Structure Prediction
Zhongju Yuan, Tao Shen, Sheng Xu, Leiye Yu, Ruobing Ren, Siqi Sun
TL;DR
This work investigates the robustness of AlphaFold2 (AF2) to sequence mutations by introducing AF2-Mutation, a gradient-free adversarial attack that uses differential evolution to generate three-residue mutations via replacement, deletion, and insertion. The method relies on maximizing the local distance difference test (lDDT) gap between AF2’s mutated-predicted structure and the native structure, with an efficient MSA approximation during evolution and final MSA re-alignment for accurate evaluation. Experiments on CASP14 show that mixed mutation strategies can produce substantial lDDT disruptions, outperforming random baselines and revealing positions with potential biological significance; a SPNS2 case study demonstrates that identified residues can drive plausible conformational changes. The results highlight AF2’s vulnerabilities to simple, discrete sequence edits and offer a practical tool for quickly identifying critical residues and plausible alternative conformations to guide experimental planning and interpretation.
Abstract
Deep learning-based approaches, such as AlphaFold2 (AF2), have significantly advanced protein tertiary structure prediction, achieving results comparable to real biological experimental methods. While AF2 has shown limitations in predicting the effects of mutations, its robustness against sequence mutations remains to be determined. Starting with the wild-type (WT) sequence, we investigate adversarial sequences generated via an evolutionary approach, which AF2 predicts to be substantially different from WT. Our experiments on CASP14 reveal that by modifying merely three residues in the protein sequence using a combination of replacement, deletion, and insertion strategies, the alteration in AF2's predictions, as measured by the Local Distance Difference Test (lDDT), reaches 46.61. Moreover, when applied to a specific protein, SPNS2, our proposed algorithm successfully identifies biologically meaningful residues critical to protein structure determination and potentially indicates alternative conformations, thus significantly expediting the experimental process.
