Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

Po-Yu Liang; Xueting Huang; Tibo Duran; Andrew J. Wiemer; Jun Bai

Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

Po-Yu Liang, Xueting Huang, Tibo Duran, Andrew J. Wiemer, Jun Bai

TL;DR

A novel method was proposed that utilized autoencoder shaped models to explore the protein embedding space, and generate novel peptide analogs by leveraging protein language models, and shows significant improvements over baseline models in similarity indicators of peptide structures, descriptors and bioactivities.

Abstract

Generating peptides with desired properties is crucial for drug discovery and biotechnology. Traditional sequence-based and structure-based methods often require extensive datasets, which limits their effectiveness. In this study, we proposed a novel method that utilized autoencoder shaped models to explore the protein embedding space, and generate novel peptide analogs by leveraging protein language models. The proposed method requires only a single sequence of interest, avoiding the need for large datasets. Our results show significant improvements over baseline models in similarity indicators of peptide structures, descriptors and bioactivities. The proposed method validated through Molecular Dynamics simulations on TIGIT inhibitors, demonstrates that our method produces peptide analogs with similar yet distinct properties, highlighting its potential to enhance peptide screening processes.

Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

TL;DR

Abstract

Paper Structure (27 sections, 9 figures, 1 table)

This paper contains 27 sections, 9 figures, 1 table.

Introduction
Related Research
Lab Experiment Based Method
Deep Learning Based Method
Method
Overview of the Proposed Method
Embedding
ProtT5 Embedding
ESM-2 Embedding
Noise
Decoder
ProtT5 Decoder
ESM-2 Decoder
Data & Experiment Setup
Data Source and Filtering
...and 12 more sections

Figures (9)

Figure 1: Method Flow Chart
Figure 2: Morgan Fingerprint Similarities
Figure 3: Sequences QSAR Similarities
Figure 4: RDkit Descriptor Similarities
Figure 5: Similarities & Alignment Score Difference
...and 4 more figures

Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

TL;DR

Abstract

Exploring Latent Space for Generating Peptide Analogs Using Protein Language Models

Authors

TL;DR

Abstract

Table of Contents

Figures (9)