Table of Contents
Fetching ...

Unlocking Potential Binders: Multimodal Pretraining DEL-Fusion for Denoising DNA-Encoded Libraries

Chunbin Gu, Mutian He, Hanqun Cao, Guangyong Chen, Chang-yu Hsieh, Pheng Ann Heng

TL;DR

DEL screening is noisy due to nonspecific interactions, and existing encoders struggle with limited chemical diversity and single-scale representations. The authors present MPDF, a Multimodal Pretraining DEL-Fusion model that combines contrastive pretraining across Graph/Text and ECFP/Text representations with a DEL-Fusion network that fuses atomic, submolecular, and molecular information via bilinear interactions. They pretrain on large biochemical datasets to improve generic feature extraction and define an enrichment parameter $R$ reflecting compound activity, optimized through a Bayesian-inspired loss on $P(R|z)$. Evaluated on three noisy DEL datasets (P, A, OA), MPDF outperforms prior methods in denoising and activity prediction, demonstrating enhanced identification of high-affinity molecules and potential improvements to DEL utility in drug discovery.

Abstract

In the realm of drug discovery, DNA-encoded library (DEL) screening technology has emerged as an efficient method for identifying high-affinity compounds. However, DEL screening faces a significant challenge: noise arising from nonspecific interactions within complex biological systems. Neural networks trained on DEL libraries have been employed to extract compound features, aiming to denoise the data and uncover potential binders to the desired therapeutic target. Nevertheless, the inherent structure of DEL, constrained by the limited diversity of building blocks, impacts the performance of compound encoders. Moreover, existing methods only capture compound features at a single level, further limiting the effectiveness of the denoising strategy. To mitigate these issues, we propose a Multimodal Pretraining DEL-Fusion model (MPDF) that enhances encoder capabilities through pretraining and integrates compound features across various scales. We develop pretraining tasks applying contrastive objectives between different compound representations and their text descriptions, enhancing the compound encoders' ability to acquire generic features. Furthermore, we propose a novel DEL-fusion framework that amalgamates compound information at the atomic, submolecular, and molecular levels, as captured by various compound encoders. The synergy of these innovations equips MPDF with enriched, multi-scale features, enabling comprehensive downstream denoising. Evaluated on three DEL datasets, MPDF demonstrates superior performance in data processing and analysis for validation tasks. Notably, MPDF offers novel insights into identifying high-affinity molecules, paving the way for improved DEL utility in drug discovery.

Unlocking Potential Binders: Multimodal Pretraining DEL-Fusion for Denoising DNA-Encoded Libraries

TL;DR

DEL screening is noisy due to nonspecific interactions, and existing encoders struggle with limited chemical diversity and single-scale representations. The authors present MPDF, a Multimodal Pretraining DEL-Fusion model that combines contrastive pretraining across Graph/Text and ECFP/Text representations with a DEL-Fusion network that fuses atomic, submolecular, and molecular information via bilinear interactions. They pretrain on large biochemical datasets to improve generic feature extraction and define an enrichment parameter reflecting compound activity, optimized through a Bayesian-inspired loss on . Evaluated on three noisy DEL datasets (P, A, OA), MPDF outperforms prior methods in denoising and activity prediction, demonstrating enhanced identification of high-affinity molecules and potential improvements to DEL utility in drug discovery.

Abstract

In the realm of drug discovery, DNA-encoded library (DEL) screening technology has emerged as an efficient method for identifying high-affinity compounds. However, DEL screening faces a significant challenge: noise arising from nonspecific interactions within complex biological systems. Neural networks trained on DEL libraries have been employed to extract compound features, aiming to denoise the data and uncover potential binders to the desired therapeutic target. Nevertheless, the inherent structure of DEL, constrained by the limited diversity of building blocks, impacts the performance of compound encoders. Moreover, existing methods only capture compound features at a single level, further limiting the effectiveness of the denoising strategy. To mitigate these issues, we propose a Multimodal Pretraining DEL-Fusion model (MPDF) that enhances encoder capabilities through pretraining and integrates compound features across various scales. We develop pretraining tasks applying contrastive objectives between different compound representations and their text descriptions, enhancing the compound encoders' ability to acquire generic features. Furthermore, we propose a novel DEL-fusion framework that amalgamates compound information at the atomic, submolecular, and molecular levels, as captured by various compound encoders. The synergy of these innovations equips MPDF with enriched, multi-scale features, enabling comprehensive downstream denoising. Evaluated on three DEL datasets, MPDF demonstrates superior performance in data processing and analysis for validation tasks. Notably, MPDF offers novel insights into identifying high-affinity molecules, paving the way for improved DEL utility in drug discovery.
Paper Structure (18 sections, 11 equations, 2 figures, 4 tables)

This paper contains 18 sections, 11 equations, 2 figures, 4 tables.

Figures (2)

  • Figure 1: Diagram of DEL panning experiment. A diverse library of DNA-barcoded compounds is incubated with an immobilized protein target of interest. After incubation, non-binders are removed by washing, and putative binders remain bound to the target. These bound compounds are then eluted, amplified, and sequenced to identify potential hit compounds for further investigation.
  • Figure 2: Multimodal Pretrained DEL-Fusion (MPDF) framework. (a) Pretrain module: Separate encoders embed graph, text, and ECFP into a joint space for activity prediction. (b) DEL denoising module: Graph and ECFP features, representing compound information at different scales, are fed into the DEL-Fusion network to learn pairwise interactions. A decoder generates the joint representation $\textbf{f}$. (c) DEL-Fusion module: Attention map matrix $\mathbf{I}$ is obtained via low-rank interaction modeling of encoded graph features $\mathbf{H}_G$ and ECFP features $\mathbf{H}_E$, measuring interaction intensity at different scales. $\mathbf{I}$ is then pooled to produce the joint representation $\textbf{f}$.