MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

Etienne Goffinet; Raghvendra Mall; Ankita Singh; Rahul Kaushik; Filippo Castiglione

MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

Etienne Goffinet, Raghvendra Mall, Ankita Singh, Rahul Kaushik, Filippo Castiglione

TL;DR

The paper tackles predicting TCR-epitope binding affinity, a key step in guiding immunotherapies. It introduces MATE-Pred, a multimodal attention-based predictor that fuses textual AA embeddings with physicochemical descriptors and predicted contact maps for each sequence via early fusion in dual encoders. The approach achieves state-of-the-art performance on large-scale training data and shows robust generalization on a challenging independent test set, with notable gains in MCC and AUC. The work highlights the value of integrating multiple modalities and points to future enhancements with structural data and additional modalities, while providing code and datasets for community use.

Abstract

An accurate binding affinity prediction between T-cell receptors and epitopes contributes decisively to develop successful immunotherapy strategies. Some state-of-the-art computational methods implement deep learning techniques by integrating evolutionary features to convert the amino acid residues of cell receptors and epitope sequences into numerical values, while some other methods employ pre-trained language models to summarize the embedding vectors at the amino acid residue level to obtain sequence-wise representations. Here, we propose a highly reliable novel method, MATE-Pred, that performs multi-modal attention-based prediction of T-cell receptors and epitopes binding affinity. The MATE-Pred is compared and benchmarked with other deep learning models that leverage multi-modal representations of T-cell receptors and epitopes. In the proposed method, the textual representation of proteins is embedded with a pre-trained bi-directional encoder model and combined with two additional modalities: a) a comprehensive set of selected physicochemical properties; b) predicted contact maps that estimate the 3D distances between amino acid residues in the sequences. The MATE-Pred demonstrates the potential of multi-modal model in achieving state-of-the-art performance (+8.4\% MCC, +5.5\% AUC compared to baselines) and efficiently capturing contextual, physicochemical, and structural information from amino acid residues. The performance of MATE-Pred projects its potential application in various drug discovery regimes.

MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

TL;DR

Abstract

Paper Structure (20 sections, 2 equations, 6 figures, 3 tables)

This paper contains 20 sections, 2 equations, 6 figures, 3 tables.

Introduction
Materials and Methods
Training Dataset Collection
Independent Test Set
TCR and Epitope modalities
AA sequence
Physicochemical Features
Contact Map
Multi-Modal architecture
Modality Fusion
Attention-based encoder
Final projection
Experimental setup
Implementation
Evaluation
...and 5 more sections

Figures (6)

Figure 1: Crystal structure of the affinity-enhanced A3A TCR engaging with melanoma- associated antigen 3 (MAGE-A3)-derived peptide presented by HLA-A*01 raman2016direct (generated with data from raman2016direct and visualized with PyMOL).
Figure 2: a) Tertiary / 3D structure of an AA sequence; b) The contact map is a distance $\ell\times \ell$ matrix where $c_{ij}$ value is the distance between the amino acid in position $i$ and $j$ in the 3D space representation.
Figure 3: a) Multi-modal AA sequence encoder. The grey embedder block on the left is pre-trained and its weights are fixed. b) the full architecture, featuring two Multi-modal AA sequence encoders (TCR Encoder and Epitope Encoder) and the final Feed-Forward Network block.
Figure 4: Multi-Modal benchmark results. From left to right: Text-only: uni-modal architecture; PCF-LC: Text + Physicochemical feature (late concat); PCF-EC: Text + Physicochemical features (early concat); CM-CNN-LC: Text + Contact Map (CNN and late concat); CM-CNN-EC: Text + Contact Map (CNN and early concat); CM-LC: Text + Contact Map (late concat); CM-EC: Text + Contact Map (early concat); PCF-CM: Text + Physicochemical features (early concat) + Contact Map (early concat).
Figure 5: Results on the independent test dataset; Text-only WPE is a variant of Text-only with AA string sequences embedding trained from scratch.
...and 1 more figures

MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

TL;DR

Abstract

MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

Authors

TL;DR

Abstract

Table of Contents

Figures (6)