FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames
Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng
TL;DR
This work tackles the challenge of antibody antigen complex modeling by identifying gradient vanishing in the Frame Aligned Point Error loss when rotation errors are large. It introduces Frame Aligned Frame Error, a geodesic distance based loss that jointly optimizes rotational and translational frame differences, and extends it to a group aware formulation GF2E to handle inter chain pose errors. Through LoRA based parameter efficient fine tuning of AF2 Multimer on antibody antigen data, the approach yields substantial improvements in DockQ metrics, especially for low homology targets. The results demonstrate that reframing inter chain errors as geodesic distances between group frames improves gradient stability and docking accuracy, with broad implications for immune complex modeling and potential extension to non protein components and pre training settings.
Abstract
Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing problem on high-rotational-error targets. To address this fundamental limitation, we propose a novel geodesic loss called Frame Aligned Frame Error (FAFE, denoted as F2E to distinguish from FAPE), which enables the model to better optimize both the rotational and translational errors between two frames. We then prove that F2E can be reformulated as a group-aware geodesic loss, which translates the optimization of the residue-to-residue error to optimizing group-to-group geodesic frame distance. By fine-tuning AF2 with our proposed new loss function, we attain a correct rate of 52.3\% (DockQ $>$ 0.23) on an evaluation set and 43.8\% correct rate on a subset with low homology, with substantial improvement over AF2 by 182\% and 100\% respectively.
