JanusDDG: A Thermodynamics-Compliant Model for Sequence-Based Protein Stability via Two-Fronts Multi-Head Attention
Guido Barducci, Ivan Rossi, Francesco Codicè, Cesare Rollo, Valeria Repetto, Corrado Pancotti, Virginia Iannibelli, Tiziana Sanavia, Piero Fariselli
TL;DR
This work tackles sequence-based prediction of protein stability changes $\Delta\Delta G$ from amino acid sequences alone. It introduces JanusDDG, a bidirectional cross-attention architecture that uses PLM embeddings (ESM2, 650M) to capture sequence-context while enforcing thermodynamic principles such as antisymmetry ($\Delta\Delta G(A,B) = -\Delta\Delta G(B,A)$) and transitivity across mutation pathways. Through a siamese design, antisymmetry is enforced by construction, and a transitivity-focused fine-tuning loss aligns multi-step mutation effects with state-function thermodynamics, yielding physically consistent predictions. Evaluated on blind, low-identity datasets (S669, S461, S96, PTmul-NR) and multi-point benchmarks (S2450, M28, PTmul-D), JanusDDG achieves state-of-the-art or competitive accuracy compared to both sequence-based and some structure-informed methods, while predicting stability changes using only sequence data. The approach holds promise for rapid protein stability assessments and design, particularly when structural data are unavailable or costly to obtain, with robust generalization to multi-residue mutations.
Abstract
Understanding how residue variations affect protein stability is crucial for designing functional proteins and deciphering the molecular mechanisms underlying disease-related mutations. Recent advances in protein language models (PLMs) have revolutionized computational protein analysis, enabling, among other things, more accurate predictions of mutational effects. In this work, we introduce JanusDDG, a deep learning framework that leverages PLM-derived embeddings and a bidirectional cross-attention transformer architecture to predict $ΔΔG$ of single and multiple-residue mutations while simultaneously being constrained to respect fundamental thermodynamic properties, such as antisymmetry and transitivity. Unlike conventional self-attention, JanusDDG computes queries (Q) and values (V) as the difference between wild-type and mutant embeddings, while keys (K) alternate between the two. This cross-interleaved attention mechanism enables the model to capture mutation-induced perturbations while preserving essential contextual information. Experimental results show that JanusDDG achieves state-of-the-art performance in predicting $ΔΔG$ from sequence alone, matching or exceeding the accuracy of structure-based methods for both single and multiple mutations. Code Availability:https://github.com/compbiomed-unito/JanusDDG
