Sequence-Only Prediction of Binding Affinity Changes: A Robust and Interpretable Model for Antibody Engineering
Chen Liu, Mingchen Li, Yang Tan, Wenrui Gou, Guisheng Fan, Bingxin Zhou
TL;DR
This paper tackles predicting mutation-induced binding affinity changes in antibody–antigen complexes without relying on structural data. It introduces ProtAttBA, a cross-attention model that uses frozen pre-trained protein language models to encode four sequences (wt and mutant antibody and antigen) and regresses $\Delta\Delta G_{bind}$ from their representations. Across three open benchmarks and multiple data-splits, ProtAttBA achieves competitive accuracy compared with both sequence- and structure-based baselines, showing robustness when input structures are uncertain. The method also provides interpretability through attention weights that highlight residues contributing to affinity changes, suggesting practical utility for rapid and cost-effective antibody engineering.
Abstract
A pivotal area of research in antibody engineering is to find effective modifications that enhance antibody-antigen binding affinity. Traditional wet-lab experiments assess mutants in a costly and time-consuming manner. Emerging deep learning solutions offer an alternative by modeling antibody structures to predict binding affinity changes. However, they heavily depend on high-quality complex structures, which are frequently unavailable in practice. Therefore, we propose ProtAttBA, a deep learning model that predicts binding affinity changes based solely on the sequence information of antibody-antigen complexes. ProtAttBA employs a pre-training phase to learn protein sequence patterns, following a supervised training phase using labeled antibody-antigen complex data to train a cross-attention-based regressor for predicting binding affinity changes. We evaluated ProtAttBA on three open benchmarks under different conditions. Compared to both sequence- and structure-based prediction methods, our approach achieves competitive performance, demonstrating notable robustness, especially with uncertain complex structures. Notably, our method possesses interpretability from the attention mechanism. We show that the learned attention scores can identify critical residues with impacts on binding affinity. This work introduces a rapid and cost-effective computational tool for antibody engineering, with the potential to accelerate the development of novel therapeutic antibodies.
