Quantifying Cross-Attention Interaction in Transformers for Interpreting TCR-pMHC Binding
Jiarui Li, Zixiang Yin, Haley Smith, Zhengming Ding, Samuel J. Landry, Ramgopal R. Mettu
TL;DR
This work tackles the interpretability challenge of encoder-decoder transformers in TCR-pMHC binding prediction by introducing Quantifying Cross-Attention Interaction (QCAI). QCAI deconstructs cross-attention into query- and key-input attributions using a GradCAM-style framework and a Moore-Penrose pseudoinverse-based separation, then aggregates explanations across layers. To evaluate explanations rigorously, the authors construct TCR-XAI, a 274-structure benchmark with residue-level ground truth distances, and introduce BRHR as a binding-region accuracy metric. On a state-of-the-art TCR-pMHC model (TULIP), QCAI achieves state-of-the-art interpretability and competitive predictive performance, outperforming encoder-only baselines on multiple metrics (ROC-AUC, LOdds, AOPC, BRHR). The method holds promise for broader application to cross-attention in diverse transformer models and protein–protein interaction tasks.
Abstract
CD8+ "killer" T cells and CD4+ "helper" T cells play a central role in the adaptive immune system by recognizing antigens presented by Major Histocompatibility Complex (pMHC) molecules via T Cell Receptors (TCRs). Modeling binding between T cells and the pMHC complex is fundamental to understanding basic mechanisms of human immune response as well as in developing therapies. While transformer-based models such as TULIP have achieved impressive performance in this domain, their black-box nature precludes interpretability and thus limits a deeper mechanistic understanding of T cell response. Most existing post-hoc explainable AI (XAI) methods are confined to encoder-only, co-attention, or model-specific architectures and cannot handle encoder-decoder transformers used in TCR-pMHC modeling. To address this gap, we propose Quantifying Cross-Attention Interaction (QCAI), a new post-hoc method designed to interpret the cross-attention mechanisms in transformer decoders. Quantitative evaluation is a challenge for XAI methods; we have compiled TCR-XAI, a benchmark consisting of 274 experimentally determined TCR-pMHC structures to serve as ground truth for binding. Using these structures we compute physical distances between relevant amino acid residues in the TCR-pMHC interaction region and evaluate how well our method and others estimate the importance of residues in this region across the dataset. We show that QCAI achieves state-of-the-art performance on both interpretability and prediction accuracy under the TCR-XAI benchmark.
