Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction
Aleix Lafita, Ferran Gonzalez, Mahmoud Hossam, Paul Smyth, Jacob Deasy, Ari Allyn-Feuer, Daniel Seaton, Stephen Young
TL;DR
The paper addresses the challenge of translating protein language model (PLM) predictions into accurate functional assessments of missense variants by leveraging Deep Mutational Scanning (DMS) data. It introduces a Lightweight Normalised Log-odds Ratio (NLR) head that normalises DMS scores across assays and enables end-to-end fine-tuning of PLMs, specifically adapting ESM-family models. Evaluations on held-out proteins and independent benchmarks such as ProteinGym and ClinVar show consistent improvements in correlation and pathogenicity classification, with larger gains for proteins that underperform in zero-shot predictions. The work demonstrates that integrating experimental DMS maps with PLMs is a scalable path to enhance variant effect prediction, while highlighting opportunities to extend the approach to MSA-based PLMs and larger DMS datasets.
Abstract
Protein Language Models (PLMs) have emerged as performant and scalable tools for predicting the functional impact and clinical significance of protein-coding variants, but they still lag experimental accuracy. Here, we present a novel fine-tuning approach to improve the performance of PLMs with experimental maps of variant effects from Deep Mutational Scanning (DMS) assays using a Normalised Log-odds Ratio (NLR) head. We find consistent improvements in a held-out protein test set, and on independent DMS and clinical variant annotation benchmarks from ProteinGym and ClinVar. These findings demonstrate that DMS is a promising source of sequence diversity and supervised training data for improving the performance of PLMs for variant effect prediction.
